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accugccccu aauaggggcg acacuccgcc augaaucacu ccccugugag gaacuacugu 60 
cuucacgcag aaagcgccua gccauggcgu uaguaugagu gucguacagc cuccaggccc 120 
cccccucccg ggagagccau aguggucugc ggaaccggug aguacaccgg aauugccggg 180 
aagacugggu ccuuucuugg auaaacccac ucuaugcccg gccauuuggg cgugcccccg 24 0 
caagacugcu agccgaguag cguuggguug cgaaaggccu ugugguacug ccugauaggg 300 
cgcuugcgag ugccccggga ggucucguag accgugcacc augagcacaa auccuaaacc 360 
ucaaagaaaa accaaaagaa acaccaaccg ucgcccaaug auugaacaag auggauugca 420 
cgcagguucu ccggccgcuu ggguggagag gcuauucggc uaugacuggg cacaacagac 4 80 
aaucggcugc ucugaugccg ccguguuccg gcugucagcg caggggcgcc cgguucuuuu 54 0 
ugucaagacc gaccuguccg gugcccugaa ugaacugcag gacgaggcag cgcggcuauc 600 
guggcuggcc acgacgggcg uuccuugcgc agcugugcuc gacguuguca cugaagcggg 660 
aagggacugg cugcuauugg gcgaagugcc ggggcaggau cuccugucau cucaccuugc 720 
uccugccgag aaaguaucca ucauggcuga ugcaaugcgg cggcugcaua cgcuugaucc 780 
ggcuaccugc ccauucgacc accaagcgaa acaucgcauc gagcgagcac guacucggau 840 
ggaagccggu cuugucgauc aggaugaucu ggacgaagag caucaggggc ucgcgccagc 900 
cgaacuguuc gccaggcuca aggcgcgcau gcccgacggc gaggaucucg ucgugaccca 960 
uggcgaugcc ugcuugccga auaucauggu ggaaaauggc cgcuuuucug gauucaucga 1020 
cuguggccgg cugggugugg cggaccgcua ucaggacaua gcguuggcua cccgugauau 1080 
ugcugaagag cuuggcggcg aaugggcuga ccgcuuccuc gugcuuuacg guaucgccgc 1140 
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ucccgauucg cagcgcaucg ccuucuaucg 
ccucucccuc cccccccccu aacguuacug 
cguuugucua uauguuauuu uccaccauau 
aaccuggccc ugucuucuug acgagcauuc 
ugcaaggucu guugaauguc gugaaggaag 
caacgucugu agcgacccuu ugcaggcagc 
gcggccaaaa gccacgugua uaagauacac 
uugugaguug gauaguugug gaaagaguca 
ggcugaagga ugcccagaag guaccccauu 
caugcuuuac auguguuuag ucgagguuaa 
acgugguuuu ccuuugaaaa acacgaugau 
caaacacgag gccuccuggg cgccauagug 
caggccgggg aaguccaaau ccuguccaca 
ucggggguuu uguggacugu uuaccacgga 
gguccgguca cgcagaugua cucgagugcu 
ccugggacca agucuuugga gccgugcaag 
cggaacgcug augucauccc ggcucggaga 
ccgagaccca uuucgaccuu gaaggggucc 
cacgucguug ggcucuuccg agcagcugug 
uucauccccg uugagacacu cgacguuguu 
acgccaccgg cugugcccca gaccuaucag 
ggaaagagca ccaagguccc ugucgcguau 
aaccccucgg uagcugccac ccugggguuu 
aaucccaaca uuaggacugg agucaggacc 
acauauggca aauuucucgc cgaugggggc 
ugcgaugaau gccacgcugu ggaugcuacc 
caagcagaga cagccggggu cagacuaacu 
gugacaaccc cccaucccga uauagaagag 
uucuauggga gggcgauucc ccuauccugc 
cacucaaaga aaaaguguga cgagcucgcg 
guggcauacu auagaggguu ggacgucucc 
gucgccaccg acgcccucau gacgggguac 
aauguagcgg ucacccaagc ugucgacuuc 
cagacugucc cacaagacgc ugucucacgc 
agacagggca cuuauaggua uguuuccacu 
guagugcuuu gugagugcua cgacgcaggg 
accaccguca ggcuuagagc guauuucaac 
cuugaauuuu gggaggcagu uuucaccggc 
caaacaaagc aagcggggga gaacuucgcg 
gccagagcca aggccccucc cccguccugg 
aagccuacgc uugcgggccc cacaccucuc 
gucacccuca cacacccugg gacgaaguac 
gucaugacca gcacgugggu ccuagcugga 
cuggcgacug gaugcguuuc caucaucggc 
gcgccggaua aggagguccu guaugaggcu 
gcggcucuca ucgaagaggg gcagcggaua 
uugcugcagc aggccucuaa gcaggcccag 
cccaaagugg aacaauuuug ggccagacac 
cucgcaggau ugucaacacu gccagggaac 
gccgcccuca ccaguccguu gucgaccagu 
ugguuagcgu cccagaucgc accacccgcg 
gugggggcug ccgugggcag cauaggccug 
uauggugcgg gcauuucggg ggcccucguc 
ucuauggaag augucaucaa ucuacugccu 
ggggucaucu gcgcggccau ucugcgccgc 
uggaugaaca ggcuuauugc cuuugcuucc 
gugacggagu cggaugcguc gcagcgugug 



ccuucuugac gaguucuucu gaguuuaaac 1200 
gccgaagccg cuuggaauaa ggccggugug 1260 
ugccgucuuu uggcaaugug agggcccgga 1320 
cuaggggucu uuccccucuc gccaaaggaa 1380 
caguuccucu ggaagcuucu ugaagacaaa 14 40 
ggaacccccc accuggcgac aggugccucu 1500 
cugcaaaggc ggcacaaccc cagugccacg 1560 
aauggcucuc cucaagcgua uucaacaagg 1620 
guaugggauc ugaucugggg ccucggugca 1680 
aaaaacgucu aggccccccg aaccacgggg 1740 
accauggcuc ccaucacugc uuaugcccag 1800 
gugaguauga cggggcguga caggacagaa 1860 
gucucucagu ccuuccucgg aacaaccauc 1920 
gcuggcaaca agacucuagc cggcuuacgg 1980 
gagggggacu ugguaggcug gcccagcccc 2040 
uguggagccg ucgaccuaua ucuggucacg 2100 
cgcggggaca agcggggagc auugcucucc 2160 
ucgggggggc cggugcucug cccuaggggc 2220 
ugcucucggg gcguggccaa auccaucgau 2280 
acaaggucuc ccacuuucag ugacaacagc 2340 
gucggguacu ugcaugcucc aacuggcagu 2400 
gccgcccagg gguacaaagu acuagugcuu 24 60 
ggggcguacc uauccaaggc acauggcauc 2520 
gugaugaccg gggaggccau cacguacucc 2580 
ugcgcuagcg gcgccuauga caucaucaua 2640 
uccauucucg gcaucggaac gguccuugau 2700 
gugcuggcua cggccacacc ccccggguca 27 60 
guaggccucg ggcgggaggg ugagaucccc 2820 
aucaagggag ggagacaccu gauuuucugc 2880 
gcggcccuuc ggggcauggg cuugaaugcc 2940 
auaauaccag cucagggaga uguggugguc 3000 
acuggagacu uugacuccgu gaucgacugc 3060 
agccuggacc ccaccuucac uauaaccaca 3120 
agucagcgcc gcgggcgcac agguagagga 3180 
ggugaacgag ccucaggaau guuugacagu 3240 
gcugcguggu acgaucucac accagcggag 3300 
acgcccggcc uacccgugug ucaagaccau 3360 
cucacacaca uagacgccca cuuccucucc 3420 
uaccuaguag ccuaccaagc uacggugugc 3480 
gacgccaugu ggaagugccu ggcccgacuc 3540 
cuguaccguu ugggcccuau uaccaaugag 3600 
aucgccacau gcaugcaagc ugaccuugag 3660 
ggaguccugg cagccgucgc cgcauauugc 3720 
cgcuugcacg ucaaccagcg agucgucguu 3780 
uuugaugaga uggaggaaug cgccucuagg 384 0 
gccgagaugu ugaaguccaa gauccaaggc 3900 
gacauacaac ccgcuaugca ggcuucaugg 3960 
auguggaacu ucauuagcgg cauccaauac 4020 
cccgcggugg cuuccaugau ggcauucagu 4080 
accaccaucc uucucaacau caugggaggc 414 0 
ggggccaccg gcuuugucgu caguggccug 4200 
gguaaggugc ugguggacau ccuggcagga 4 2 60 
gcauucaaga ucaugucugg cgagaagccc 4320 
gggauccugu cuccgggagc ccugguggug 4380 
cacgugggac cgggggaggg cgcgguccaa 4440 
agaggaaacc acgucgcccc uacucacuac 4500 
acccaacuac uuggcucucu uacuauaacc 4560 
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agccuacuca 
uccuggcucc 
cugaccucua 
uacaagggug 
aucucuggca 
accuggcagg 
cccacgaacu 
cagcaugggu 
caacuaccuu 
cccacaccaa 
gcugucgggu 
cuaacagauc 
ccuccaucug 
ugcaccaccc 
ggcggugugg 
auggccgagg 
agcggguuuc 
gaaucgugga 
cccaagaagg 
accauaucag 
ggugaugcag 
ggugagccgg 
ccuggagauc 
gggguagcuc 
accgugugcu 
gaagaggaaa 
guguacugua 
acgcaagugc 
aaggucagcg 
gcaagaucca 
aaccacauca 
accaucaugg 
gcucgccuca 
gacauuacac 
ccugcccaac 
uuuucguaug 
gaguccauau 
acugagagac 
agacguugcc 
gugaaagccc 
ggcgaugacc 
agagccuuca 
gaauaugacc 
cggggccgcc 
ugggaaacag 
ccaaccauau 
gacacccugg 
uuggaccuuc 
uacucucacc 
cucagggugu 
aaagcggccg 
acuccauugc 
gggggcgaca 
cuccuacuuu 
uagguacacu 
uuuuuuuuuu 
uuucuuggug 



gaagacucca 
gcgacgugug 
aauuguuccc 
ugugggccgg 
auguccgccu 
ggaccuuucc 
acaagaccgc 
cguacuccua 
cuccagaguu 
agccguuuuu 
cccagcuucc 
cgccccacau 
aggcgagcuc 
acagcaacac 
cucagacaga 
aagagagcga 
cacgggccuu 
ggaggccaga 
ccccgacgcc 
aagcccucca 
gcucguccac 
cccccucaga 
cggaccugga 
ccgguucggg 
gcuccauguc 
aguugccaau 
caacaucaaa 
ucgacgccca 
caaggcuccu 
aguauggauu 
aguccgugug 
ccaaaaauga 
ucguuuaccc 
aaaagcuucc 
ggguggagua 
auacccgaug 
accaggccug 
uuuacguagg 
gcgccagcgg 
uagcggccug 
uaguagucau 
cggaggccau 
uggagcuaau 
gcagauacua 
uuagacacuc 
ggguucgcau 
accagaaccu 
cagccauaau 
acgaacugac 
ggaagagucg 
uuugcggccg 
cggaggcgcg 
uuuuucacag 
ucguaggggu 
ccauagcuaa 
cuuuuuuuuu 
gcuccaucuu 



caauuggaua 
ggacuggguu 
caagcugccc 
cacuggcauc 
gggcucuaug 
uaucaauugc 
caucuggagg 
uguaacagga 
uuucuccugg 
ccgggaugag 
cugugaaccu 
cacggcggag 
cucagugagc 
cuaugacgug 
gccugagucc 
ccuugagccc 
accggcuugg 
uuaccaaccg 
ucccccaagg 
gcaacuggcc 
gggggcgggc 
gacagguucc 
gucugaucag 
cucggggucu 
auacuccugg 
caacccuuug 
gagcgccuca 
uuaugacuca 
caccuuggag 
cggggccaag 
gaaggaccuc 
gguguucugc 
ugaccucggc 
ucaggcggua 
ucucuugaaa 
cuucgacuca 
cucccugccc 
agggcccaug 
ggugcuaacc 
caaggcugcg 
cucagaaagc 
gaccagguac 
aacauccugu 
ccugaccaga 
cccuaucaau 
gguccuaaug 
caacuuugag 
ugagagguua 
gcggguggcu 
ggcucgcgca 
auaucucuuc 
ccuacuggac 
cgugucgcgc 
aggccucuuc 
cuguuccuuu 
uuuuucccuc 
agcccuaguc 



acugaggacu 
ugcaccaucu 
ggccuccccu 
augaccacgc 
aggaucacag 
uacacggagg 
guggcggccu 
cugaccacug 
guggacggug 
gucucguucu 
gagcccgacg 
acugcggcgc 
cagcuaucag 
gacauggucg 
agggugcccg 
ucaauaccau 
gcacggccug 
cccaccguug 
agacgccgga 
aucaagaccu 
gccgccgaau 
gccuccucua 
guagagcuuc 
uggucuacuu 
accggggcuc 
aguaacucgc 
cagagggcua 
gucuuaaagg 
gaggcgugcc 
gagguccgca 
cuggaagacc 
guggaccccg 
guccgggucu 
augggagcuu 
gcaugggcgg 
accgucacug 
gaggaggccc 
uucaacagca 
acuagcaugg 
gggauaguug 
caggggacug 
ucugccccuc 
uccucaaaug 
gacccaacca 
ucauggcugg 
acacacuucu 
auguauggau 
cacgggcuug 
ucagcccuca 
gucagggcgu 
aauugggcgg 
uuauccaguu 
gcccgacccc 
cuacuccccg 
uuuuuuuuuu 
uuucuucccu 
acggcuagcu 



gccccauccc 
ugacagacuu 
ucaucucuug 
gcugcccuug 
ggccuaaaac 
gccagugcgc 
cggaguacgc 
acaaucugaa 
ugcagaucca 
gcguugggcu 
cagacguauu 
ggcgcuuggc 
caccgucgcu 
augccaaccu 
uucuggacuu 
cggagugcau 
acuacaaccc 
cugguugugc 
cagugggucu 
uuggccagcc 
ccggcggucc 
ugcccccccu 
aaccuccccc 
gcuccgagga 
uaauaacucc 
uguugcgaua 
aaaagguaac 
acaucaagcu 
aguugacucc 
gcuuguccgg 
cacaaacacc 
ccaagggggg 
gcgagaaaau 
ccuauggcuu 
aaaagaagga 
agagagacau 
gcacugccau 
agggucaaac 
guaacaccau 
cgcccacaau 
aggaggacga 
cuggugaucc 
ugucuguggc 
cuccacucgc 
gaaacaucau 
ucuccauucu 
caguauacuc 
acgccuuuuc 
gaaaacuugg 
cccucaucuc 
ugaagaccaa 
gguucaccgu 
gcucauuacu 
cucgguagag 
uuuuuuuuuu 
ucucaucuua 
gugaaagguc 



augcuccgga 
caaaaauugg 
ucaaaagggg 
cggcgccaac 
cugcaugaac 
gccgaaaccc 
ggaggugacg 
aauuccuugc 
uagguuugca 
uaauuccuau 
gagguccaug 
acggggauca 
gcgggccacc 
gcucauggag 
ucucgagcca 
gcuccccagg 
gccgcucgug 
ucuccccccc 
gagcgagagc 
ccccucgagc 
gacguccccu 
cgagggggag 
ccaggggggg 
ggacgauacc 
cuguagcccc 
ccauaacaag 
uuuugacagg 
agcggcuucc 
accccauucu 
gagggccguu 
aauucccaca 
uaagaaacca 
ggcccucuau 
ccaguacucc 
ccccaugggu 
caggaccgag 
acacucgcug 
cugcgguuac 
cacaugcuau 
gcugguaugc 
gcggaaccug 
ccccagaccg 
guugggcccg 
ccgggcugcc 
ccaguaugcu 
caugguccaa 
cgugaauccu 
uaugcacaca 
ggcgccaccc 
ccguggaggg 
gcucaaacuc 
cggcgccggc 
cuucggccua 
cggcacacac 
uuuuuuuuuu 
uucuacuuuc 
cgugagccgc 



4620 
4680 
4740 
4800 
4860 
4920 
4980 
5040 
5100 
5160 
5220 
5280 
5340 
5400 
5460 
5520 
5580 
5640 
5700 
5760 
5820 
5880 
5940 
6000 
6060 
6120 
6180 
6240 
6300 
6360 
6420 
6480 
6540 
6600 
6660 
6720 
6780 
6840 
6900 
6960 
7020 
7080 
7140 
7200 
7260 
7320 
7380 
7440 
7500 
7560 
7620 
7680 
7740 
7800 
7860 
7920 
7980 
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augacugcag agagugccgu aacuggucuc ucugcagauc augu 



8024 



<210> 2 
<211> 8024 
<212> RNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: replicon 
<400> 2 

acccgccccu aauaggggcg acacuccgcc augaaucacu ccccugugag gaacuacugu 60 
cuucacgcag aaagcgucua gccauggcgu uaguaugagu gucguacagc cuccaggccc 120 
cccccucccg ggagagccau aguggucugc ggaaccggug aguacaccgg aauugccggg 180 
aagacugggu ccuuucuugg auaaacccac ucuaugcccg gccauuuggg cgugcccccg 240 
caagacugcu agccgaguag cguuggguug cgaaaggccu ugugguacug ccugauaggg 300 
ugcuugcgag ugccccggga ggucucguag accgugcacc augagcacaa aucccaaacc 360 
ucaaagaaaa accaaaagaa acacuaaccg ucgcccaaug auugaacaag auggauugca 420 
cgcagguucu ccggccgcuu ggguggagag gcuauucggc uaugacuggg cacaacagac 480 
aaucggcugc ucugaugccg ccguguuccg gcugucagcg caggggcgcc cgguucuuuu 54 0 
ugucaagacc gaccuguccg gugcccugaa ugaacugcag gacgaggcag cgcggcuauc 600 
guggcuggcc acgacgggcg uuccuugcgc agcugugcuc gacguuguca cugaagcggg 660 
aagggacugg cugcuauugg gcgaagugcc ggggcaggau cuccugucau cucaccuugc 720 
uccugccgag aaaguaucca ucauggcuga ugcaaugcgg cggcugcaua cgcuugaucc 780 
ggcuaccugc ccauucgacc accaagcgaa acaucgcauc gagcgagcac guacucggau 840 
ggaagccggu cuugucgauc aggaugaucu ggacgaagag caucaggggc ucgcgccagc 900 
cgaacuguuc gccaggcuca aggcgcgcau gcccgacggc gaggaucucg ucgugaccca 960 
uggcgaugcc ugcuugccga auaucauggu ggaaaauggc cgcuuuucug gauucaucga 1020 
cuguggccgg cugggugugg cggaccgcua ucaggacaua gcguuggcua cccgugauau 1080 
ugcugaagag cuuggcggcg aaugggcuga ccgcuuccuc gugcuuuacg guaucgccgc 1140 
ucccgauucg cagcgcaucg ccuucuaucg ccuucuugac gaguucuucu gaguuuaaac 1200 
ccucucccuc cccccccccu aacguuacug gccgaagccg cuuggaauaa ggccggugug 12 60 
cguuugucua uauguuauuu uccaccauau ugccgucuuu uggcaaugug agggcccgga 1320 
aaccuggccc ugucuucuug acgagcauuc cuaggggucu uuccccucuc gccaaaggaa 1380 
ugcaaggucu guugaauguc gugaaggaag caguuccucu ggaagcuucu ugaagacaaa 1440 
caacgucugu agcgacccuu ugcaggcagc ggaacccccc accuggcgac aggugccucu 1500 
gcggccaaaa gccacgugua uaagauacac cugcaaaggc ggcacaaccc cagugccacg 1560 
uugugaguug gauaguugug gaaagaguca aauggcucuc cucaagcgua uucaacaagg 1620 
ggcugaagga ugcccagaag guaccccauu guaugggauc ugaucugggg ccucggugca 1680 
caugcuuuac auguguuuag ucgagguuaa aaaaacgucu aggccccccg aaccacgggg 174 0 
acgugguuuu ccuuugaaaa acacgauaau accauggccc ccaucaccgc uuacgcccag 1800 
cagacacgag gucucuuggg cucuauagug gugagcauga cggggcguga caagacagaa 18 60 
caggccgggg agguccaagu ccuguccaca gucacucagu ccuuccucgg aacauccauu 1920 
ucgggggucu uauggacugu uuaccacgga gcuggcaaca agacacuagc cggcucgcgg 1980 
ggcccgguca cgcagaugua cucgagcgcc gagggggacu uggucgggug gcccagcccu 204 0 
ccugggacca aaucuuugga gccguguacg uguggagcgg ucgaccugua uuuggucacg 2100 
cggaacgcug augucauccc ggcucgaaga cgcggggaca agcggggagc gcugcucucc 2160 
ccgagacccc uuucgaccuu gaaggggucc ucggggggac cugugcuuug cccuaggggc 2220 
cacgcugucg gaaucuuccg ggcagcugug ugcucucggg guguggcuaa guccauagau 2280 
uucauccccg uugagacgcu cgacaucguc acgcggucuc ccaccuuuag ugacaacagc 234 0 
acaccaccag cugugcccca gaccuaucag gugggguacu ugcacgcccc cacuggcagu 2400 
ggaaaaagca ccaagguccc cgucgcguac gccgcccagg gguauaaagu gcuggugcuc 24 60 
aaucccucgg uggcugccac ccugggauuu ggggcguacu uguccaaggc acauggcauc 2520 
aaccccaaca uuaggacugg agucagaacu gugacgaccg gggagcccau uacauacucc 2580 
acguauggua aauuccucgc cgaugggggc ugcgcaggcg gcgccuauga caucaucaua 264 0 
ugcgaugaau gccacucugu ggaugcuacc acuauucucg gcaucgggac aguccuugac 2700 
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caagcagaga 
gugacaaccc 
uucuauggga 
cacucaaaga 
guggcauauu 
guugccaccg 
aacguagcgg 
cagacugucc 
agacugggca 
guaguacucu 
acgaccguca 
cuugaguuuu 
cagacaaagc 
gccagggcca 
aagcccacgc 
gucacccuua 
gucaugacca 
uuagcgaccg 
gcuccggaca 
gcggcucucc 
uuauugcagc 
cccaagaugg 
cucgcaggac 
gccgcccuca 
uggcuggcgu 
gugggagcug 
uauggugcgg 
uccauggagg 
ggagucaucu 
uggaugaaca 
gugacggagu 
agucuacuca 
ucguggcucc 
cugaccucca 
uacaagggcg 
aucucuggca 
accuggcagg 
gcguuaaacu 
cagcacggau 
caacuccccu 
cccacaccaa 
gucgucgggu 
cuaacagacc 
cccccaucug 
ugcaccaccc 
ggcggcguga 
augaccgagg 
aagagguucc 
gaaucgugga 
cccaaaaaga 
accauaggag 
ggcgauucag 
gacgaguugg 
ccuggggacc 
gaggcagcuc 
gucgugugcu 
gaagaggaaa 



cagccggggu 
cccaucccaa 
gggcguuucc 
aaaaguguga 
acagaggguu 
acgcccucau 
ucacccaggc 
cgcaagacgc 
uuuauaggua 
gugagugcua 
ggcucagggc 
gggaggcagu 
agucggggga 
aagcgccccc 
uugugggccc 
cacaccccgu 
gcacgugggu 
gguguguuuc 
aggagguccu 
uugaagaggg 
aagccucuaa 
agcaauucug 
ugucaacacu 
ccaguccguu 
cccaaauugc 
cuguuggcag 
gcauuucggg 
augucaucaa 
gcgcggccau 
ggcuuaucgc 
cggaugcguc 
ggagacuuca 
gcgaugugug 
agcuguuccc 
ugugggccgg 
acguccgcuu 
ggaccuuucc 
ucaagaccgc 
cauaugccua 
cuccagaguu 
agccguuuuu 
cucagcuucc 
caucccauau 
aggcaagcuc 
acgguaggac 
uucggauaga 
aagagggcga 
caccggccuu 
agaggccaga 
ccccgacgcc 
augcccucca 
gccuuuccac 
cucuuucgga 
cagaccugga 
ccggcucgga 
gcuccauguc 
aguugccaau 



caggcuaacu 
uauagaggag 
ccugucuuac 
cgagcucgca 
ggacgucucc 
gacgggguau 
cguagacuuc 
ugucucacgu 
uguuuccacu 
cgacgcagga 
guauuucaac 
uuucaccggc 
aaauuucgca 
cccguccugg 
uacaccucuc 
gacaaaauac 
ccuggcuggg 
caucauuggc 
cuaugaggcu 
gcagcggaua 
acaggcccag 
ggccaaacau 
gccagggaac 
gucaacuagc 
gccacccgcg 
cauaggcuug 
ggcccucguc 
cuugcugccu 
ucugcgccgc 
cuucgcuucc 
gcagcguguc 
caacuggauc 
ggacuggguc 
aaagaugccu 
cacuggcauc 
gggcucuaug 
uaucaauugu 
caucuggaga 
uauaacaggg 
uuucucuugg 
ccgggaugag 
cugugacccu 
cacggcggag 
cucagcgagc 
cuaugaugug 
gucugagucc 
ccuugagccu 
accggcuugg 
uuaccaacca 
uccuccaagg 
acagcuggcc 
gggggcggac 
gacagguucu 
gccugagcag 
cucggggucc 
auauuccugg 
uaacuccuug 



guacuggcca 
guagcccucg 
aucaagggag 
acggcccuuc 
auaauaccaa 
acuggagacu 
agccuggacc 
agucagcgcc 
ggugagcgag 
gcugcuuggu 
acgccuggcu 
cucacacaca 
uacuuaguag 
gacgucaugu 
cuguaccguu 
aucgccacau 
ggagucuuag 
cguuuacaca 
uuugaugaga 
gccgagaugc 
gacauacaac 
auguggaacu 
ccugcugugg 
accaccaucc 
ggggccacug 
gguaaagugc 
gcguuuaaga 
gggauucugu 
caugugggac 
agaggaaacc 
acccaacugc 
acugaggauu 
uguaccaucc 
ggccuccccu 
augaccacac 
agaaucacag 
uauacagaag 
guggcggccu 
cugaccacug 
guggacggag 
gucucguuca 
gagcccgaca 
gcugcagcgc 
cagcugucgg 
gacauggugg 
aaaguggucg 
ucaguaccau 
gcgcggccug 
cccacuguug 
agacgccgga 
aucaaguccu 
gccgccgacu 
accuccucca 
guagagcuuc 
uggucuacuu 
accggggcuc 
agcaacucgc 



cggccacgcc 
gacaggaggg 
ggaggcacuu 
ggggcauggg 
cucaaggaga 
uugacuccgu 
ccaccuucac 
gagggcgcac 
ccucaggaau 
augagcucuc 
ugccugugug 
uagacgcuca 
ccuaucaggc 
ggaagugcuu 
ugggcucugu 
gcaugcaagc 
cagccgucgc 
ucaaccagcg 
uggaggaaug 
ugaaguccaa 
ccgcugugca 
ucauaagcgg 
cuuccaugau 
uucuuaacau 
gcuuuguugu 
ugguggacau 
ucaugucugg 
cuccaggugc 
cgggggaagg 
acgucgcccc 
uuggcucucu 
gccccauccc 
uaacagacuu 
uuaucucuug 
gaugccccug 
gacccaaaac 
gccagugcuu 
cagaguacgc 
acaacuuaaa 
uacaaaucca 
gcguugggcu 
cugagguagu 
ggcguuuagc 
cgccaucgcu 
augccaaccu 
uucuggacuc 
cggaguauau 
auuacaaccc 
cgggcugugc 
cagugggucu 
uuggccagcc 
ccggcgaucg 
ugcccccccu 
aaccuccucc 
gcuccgagga 
uaauaacucc 
uguugcgaua 



ccccgggucg 
ugagaucccc 
gauuuucugc 
cuugaacgcu 
uguggugguc 
gaucgacugc 
uauaaccaca 
ggguagagga 
guuugacagu 
accaguggag 
ccaggaccac 
uuuccuuucc 
cacagugugc 
gacucgacuc 
uaccaacgag 
ugaccucgag 
cgcguauugc 
agcugucguc 
ugccuccaga 
gauccaaggc 
agcuucgugg 
cauucaguac 
ggcauucagc 
ucuggggggc 
caguggccug 
ccuggcaggg 
cgagaagccc 
ucugguggug 
cgcgguccaa 
uacucacuac 
cacuauaacu 
augcgccggc 
uaagaacugg 
ccaaaagggg 
cggcgccaac 
cugcaugaac 
gccgaaaccc 
ggaagugacg 
agucccuugc 
uagguccgcc 
caauucauuu 
gauguccaug 
gcggggguca 
gcgagccacc 
guucaugggg 
ccucgacuca 
gcuccccagg 
accgcuugug 
ucuccccccc 
gagcgagagc 
ccccccaagc 
gacacccccu 
cgagggggag 
ccaggggggg 
ggaugacucc 
uuguagcccc 
ccauaacaag 



2760 
2820 
2880 
294 0 
3000 
3060 
3120 
3180 
3240 
3300 
3360 
3420 
3480 
3540 
3600 
3660 
3720 
3780 
3840 
3900 
3960 
4020 
4080 
4140 
4200 
4260 
4320 
4380 
4440 
4500 
4560 
4620 
4680 
4740 
4800 
4860 
4920 
4980 
5040 
5100 
5160 
5220 
5280 
5340 
5400 
5460 
5520 
5580 
5640 
5700 
5760 
5820 
5880 
5940 
6000 
6060 
6120 
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guauacugua 
augcaagugc 
aaggucagcg 
gcaagaucca 
aaccacauca 
accaucaugg 
gcucgccuua 
gaugucacac 
cccgcucagc 
uuuucguaug 
gaguccauau 
acugagagac 
aggcguugcc 
guaaaagccc 
ggcgacgacu 
agagccuuca 
gaauaugacc 
cagggccgcc 
ugggaaacag 
ccaaccauau 
gacacccuag 
cuggaccucc 
uacacucccc 
cucagagcgu 
agggcggccg 
acuccuuugc 
gggggcgaca 
cuccuacuuu 
uagcuacacu 
uuuuuuuuuu 
uuucuuggug 
augacugcag 



cuacaucaaa 
ucgacgccua 
caaggcuccu 
aguauggguu 
aguccgugug 
ccaaaaauga 
ucguuuaccc 
aaaagcuucc 
ggguggaguu 
auacccgaug 
accaggccug 
ucuauguggg 
gcgccagcgg 
uagcggcuug 
uggucgucau 
cggaggcuau 
uggagcuaau 
gcagauacua 
uuagacacuc 
ggguucgcau 
accagaaccu 
cagccauaau 
acgaacugac 
ggaagagucg 
uuugcggucg 
cggaggcacg 
uuuaucacag 
cuguaggggu 
ccauagcuaa 
cuuuuuuuuu 
gcuccaucuu 
agagugccgu 



gagugccuca 
uuaugauuca 
caccuuagag 
uggggcuaag 
gaaggaccuc 
gguguucugc 
ugaccucggc 
ucaggcggug 
ucucuugaag 
cuuugacuca 
cuccuuaccc 
agggcccaug 
ggugcuuacc 
caaggcugcg 
cucagaaagc 
gaccagguau 
aacaucuugu 
ccugaccaga 
cccugucaau 
gguccugaug 
uaacuuugaa 
ugaaagguua 
gcggguggcu 
ggcgcgugca 
guaccucuuc 
ccuccuggau 
cgugucgcgu 
aggccucuuc 
cuguuccuuu 
uuuuucccuc 
agcccuaguc 
aacuggucuc 



cuaagggcua 
gucuuaaagg 
gaggcgugcc 
gagguccgca 
uuggaagacu 
guggaccccg 
gucagggucu 
augggggcuu 
gcaugggcgg 
accgucacug 
gaggaggccc 
uucaacagca 
acuaguaugg 
gggauaauug 
caggggacug 
ucugccccuc 
uccucaaacg 
gaccccacca 
ucauggcugg 
acacacuucu 
auguacggau 
cacgggcuug 
ucagcccuca 
guuagggcgu 
aacugggcgg 
uuguccaguu 
gcccgacccc 
cuacuccccg 
uuuuuuuuuu 
uuucuucccu 
acggcuagcu 
ucugcagauc 



aaaagguaac 
acaucaagcu 
aauugacccc 
gcuuguccgg 
cacaaacacc 
ccaagggggg 
gcgagaagau 
cuuauggcuu 
aaaagagaga 
agagagacau 
gaacugccau 
agggccaguc 
ggaacaccau 
cgcccacgau 
aggaggacga 
cuggugaccc 
ugucuguggc 
cuucaauugc 
gaaacaucau 
ucuccauucu 
cgguguacuc 
acgccuucuc 
gaaaacuugg 
cccucaucuc 
ugaagaccaa 
gguuuaccgu 
gccuauuacu 
cucgauagag 
uuuuuuuuuu 
ucucaucuua 
gugaaagguc 
augu 



uuuugauagg 
agcggccucc 
accccacucu 
gagggccguc 
aauuccuaca 
uaaaaaacca 
ggcccuuuau 
ccaguacucc 
cccuaugggu 
caggacugag 
acacucgcug 
cugcggguac 
cacaugcuau 
gcugguaugc 
gcggaaccug 
ccccagaccg 
acuuggccca 
ccgggcugcc 
ccaguacgcu 
cauggcccag 
cgugaguccu 
ucugcacaca 
ggcgccaccc 
ccgugggggg 
gcucaaacuc 
cggcgccggc 
ccuuagccua 
cggcacacau 
uuuuuuuuuu 
uucuacuuuc 
cgugagccgc 



6180 
6240 
6300 
6360 
6420 
6480 
6540 
6600 
6660 
6720 
6780 
6840 
6900 
6960 
7020 
7080 
7140 
7200 
7260 
7320 
7380 
7440 
7500 
7560 
7620 
7680 
7740 
7800 
7860 
7920 
7980 
8024 



<210> 3 

<211> 9678 

<212> DNA 

<213> Hepatitis C virus 

<220> 
<221> CDS 

<222> (341) . . (9442) 
<400> 3 

acctgcccct aataggggcg acactccgcc atgaatcact cccctgtgag gaactactgt 60 

cttcacgcag aaagcgccta gccatggcgt tagtatgagt gtcgtacagc ctccaggccc 120 

ccccctcccg ggagagccat agtggtctgc ggaaccggtg agtacaccgg aattgccggg 180 

aagactgggt cctttcttgg ataaacccac tctatgcccg gccatttggg cgtgcccccg 24 0 

caagactgct agccgagtag cgttgggttg cgaaaggcct tgtggtactg cctgataggg 300 



cgcttgcgag tgccccggga ggtctcgtag accgtgcacc atg age aca aat cct 355 

Met Ser Thr Asn Pro 
1 5 



aaa cct caa 
Lys Pro Gin 



gtt aag ttc 
Val Lys Phe 



ccg cgc agg 
Pro Arg Arg 
40 

gag egg tec 
Glu Arg Ser 
55 

cgc tec act 
Arg Ser Thr 
70 

tat ggg aat 
Tyr Gly Asn 



ggc tct cgc 
Gly Ser Arg 



aac gtg ggt 
Asn Val Gly 
120 

atg ggg tac 
Met Gly Tyr 
135 

get gtc gcg 
Ala Val Ala 
150 

aca ggg aac 
Thr Gly Asn 



ttg tec tgc 
Leu Ser Cys 



agt age age 
Ser Ser Ser 
200 

tgg cag etc 
Trp Gin Leu 
215 



aga aaa acc 
Arg Lys Thr 
10 

ccg ggc ggc 
Pro Gly Gly 
25 

ggc ccc agg 
Gly Pro Arg 



cag cca cgt 
Gin Pro Arg 



ggc aag gee 
Gly Lys Ala 
75 

gag gga etc 
Glu Gly Leu 
90 

ccc tec tgg 
Pro Ser Trp 
105 

aaa gtc ate 
Lys Val He 



ate ccc gtc 
He Pro Val 



cac ggc gtg 
His Gly Val 
155 

eta ccc ggt 
Leu Pro Gly 
170 

ate acc gtt 
He Thr Val 
185 

tac atg gtg 
Tyr Met Val 



gag get gcg 
Glu Ala Ala 



aaa aga aac 
Lys Arg Asn 



ggc cag ate 
Gly Gin He 

30 

ttg ggt gtg 
Leu Gly Val 
45 

ggg aga cgc 
Gly Arg Arg 
60 

tgg gga aaa 
Trp Gly Lys 



ggc tgg gca 
Gly Trp Ala 



ggc ccc act 
Gly Pro Thr 
110 

gac acc eta 
Asp Thr Leu 
125 

gta ggc gee 
Val Gly Ala 
140 

aga gtc ctg 
Arg Val Leu 



ttc ccc ttt 
Phe Pro Phe 



ccg gtc tct 
Pro Val Ser 
190 

acc aat gac 
Thr Asn Asp 
205 

gtt etc cac 
Val Leu His 
220 



acc aac cgt 
Thr Asn Arg 
15 

gtt ggc gga 
Val Gly Gly 



cgc acg aca 
Arg Thr Thr 



cag ccc ate 
Gin Pro He 
65 

cca ggt cgc 
Pro Gly Arg 
80 

gga tgg etc 
Gly Trp Leu 
95 

gac ccc egg 
Asp Pro Arg 



acg tgt ggc 
Thr Cys Gly 



ccg ctt agt 
Pro Leu Ser 
145 

gag gac ggg 
Glu Asp Gly 
160 

tct ate ttc 
Ser He Phe 
175 

get gee cag 
Ala Ala Gin 



tgc tec aat 
Cys Ser Asn 



gtc ccc ggg 
Val Pro Gly 
225 



cgc cca gaa 
Arg Pro Glu 
20 

gta tac ttg 
Val Tyr Leu 
35 

agg aaa act 
Arg Lys Thr 
50 

ccc aaa gat 
Pro Lys Asp 



ccc tgg ccc 
Pro Trp Pro 



ctg tec ccc 
Leu Ser Pro 
100 

cat agg teg 
His Arg Ser 
115 

ttt gee gac 
Phe Ala Asp 
130 

ggc gee gee 
Gly Ala Ala 



gtt aat tat 
Val Asn Tyr 



ttg ctg gee 
Leu Leu Ala 
180 

gtg aag aat 
Val Lys Asn 
195 

gac age ate 
Asp Ser He 
210 

tgc gtc ccg 
Cys Val Pro 



gac 403 
Asp 



ttg 451 
Leu 



teg 499 
Ser 



egg 547 
Arg 



eta 595 
Leu 
85 

cga 643 
Arg 



cgc 691 
Arg 



etc 739 
Leu 



aga 787 
Arg 



gca 835 

Ala 

165 

ctg 883 
Leu 



acc 931 
Thr 



act 979 
Thr 



tgc 1027 
Cys 
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gag aga gtg 
Glu Arg Val 
230 

atg get gtg 
Met Ala Val 



ate gat atg 
lie Asp Met 



ggg gac etc 
Gly Asp Leu 
280 

teg ccg cag 
Ser Pro Gin 
295 

cct ggc ace 
Pro Gly Thr 
310 

tgg teg ccc 
Trp Ser Pro 



gag gtc ate 
Glu Val He 



ggc ttg gee 
Gly Leu Ala 
360 

ate ctt ctg 
He Leu Leu 
375 

ggc get gtt 
Gly Ala Val 
390 

ggc cct cag 
Gly Pro Gin 



ate aac cgt 
He Asn Arg 



etc gcg gee 
Leu Ala Ala 
440 

ggg cgc ctg 



ggg aat acg 
Gly Asn Thr 
235 

egg cag ccc 
Arg Gin Pro 
250 

gtt gtg atg 
Val Val Met 
265 

tgt ggc ggg 
Cys Gly Gly 



tac cac tgg 
Tyr His Trp 



ate act gga 
He Thr Gly 
315 

acg gee ace 
Thr Ala Thr 
330 

ata gac ate 
He Asp He 
345 

tac ttc tct 
Tyr Phe Ser 



ctg gec get 
Leu Ala Ala 



gca cgt tec 
Ala Arg Ser 
395 

cag aac att 
Gin Asn He 
410 

act gee ttg 
Thr Ala Leu 
425 

ttg ttc tac 
Leu Phe Tyr 



tec gee tgc 



tea egg tgt 
Ser Arg Cys 



ggt gee etc 
Gly Ala Leu 



tec gee ace 
Ser Ala Thr 
270 

gtg atg etc 
Val Met Leu 
285 

ttt gtg caa 
Phe Val Gin 
300 

cac cgc atg 
His Arg Met 



atg ate ctg 
Met He Leu 



gtt age ggg 
Val Ser Gly 
350 

atg cag gga 
Met Gin Gly 
365 

ggg gtg gac 
Gly Val Asp 
380 

ace aac gtg 
Thr Asn Val 



cag etc att 
Gin Leu He 



aat tgc aat 
Asn Cys Asn 
430 

ace aac cgc 
Thr Asn Arg 
445 

cgc aac ate 



tgg gtg cca 
Trp Val Pro 
240 

acg cag ggt 
Thr Gin Gly 
255 

ttc tgc tct 
Phe Cys Ser 



gcg gee cag 
Ala Ala Gin 



gaa tgc aat 
Glu Cys Asn 
305 

gca tgg gac 
Ala Trp Asp 
320 

gcg tac gtg 
Ala Tyr Val 
335 

get cac tgg 
Ala His Trp 



gcg tgg gcg 
Ala Trp Ala 



gcg ggc ace 
Ala Gly Thr 
385 

att gee ggc 
He Ala Gly 
400 

aac ace aac 
Asn Thr Asn 
415 

gac tec ttg 
Asp Ser Leu 



ttt aac teg 
Phe Asn Ser 



gag get ttc 



gtc teg cca 
Val Ser Pro 



ctg egg acg 
Leu Arg Thr 
260 

get etc tac 
Ala Leu Tyr 
275 

gtg ttc ate 
Val Phe He 
290 

tgc tec ate 
Cys Ser He 



atg atg atg 
Met Met Met 



atg cgc gtc 
Met Arg Val 
340 

ggc gtc atg 
Gly Val Met 
355 

aag gtc att 
Lys Val He 
370 

ace ace gtt 
Thr Thr Val 



gtg ttc age 
Val Phe Ser 



ggc agt tgg 
Gly Ser Trp 
420 

aac ace ggc 
Asn Thr Gly 
435 

tea ggg tgt 
Ser Gly Cys 
450 

egg ata ggg 



aac 1075 

Asn 

245 

cac 1123 
His 



gtg 1171 
Val 



gtc 1219 
Val 



tac 1267 
Tyr 



aac 1315 

Asn 

325 

ccc 1363 
Pro 



ttc 1411 
Phe 



gtc 1459 
Val 



gga 1507 
Gly 



cat 1555 

His 

405 

cac 1603 
His 



ttt 1651 
Phe 



cca 1699 
Pro 



tgg 1747 
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Gly Arg Leu 
455 

ggc acc eta 
Gly Thr Leu 
470 

ccg tac tgc 
Pro Tyr Cys 



agg tct gtg 
Arg Ser Val 



gtg ggc acg 
Val Gly Thr 
520 

aat gag aca 
Asn Glu Thr 
535 

tea tgg ttc 
Ser Trp Phe 
550 

tgt ggc gcg 
Cys Gly Ala 



gac ttg ttg 
Asp Leu Leu 



tat att aag 
Tyr lie Lys 
600 

cac tac cct 
His Tyr Pro 
615 

ate ttc aag 
lie Phe Lys 
630 

gec gca tgc 
Ala Ala Cys 



gac agg agt 
Asp Arg Ser 



ate ctg ccc 
lie Leu Pro 



Ser Ala Cys 



cag tac gag 
Gin Tyr Glu 
475 

tgg cac tac 
Trp His Tyr 
490 

tgt ggc cca 
Cys Gly Pro 
505 

acc gac aga 
Thr Asp Arg 



gat gtc ttc 
Asp Val Phe 



ggc tgc acg 
Gly Cys Thr 
555 

cca cct tgc 
Pro Pro Cys 
570 

tgc cct acg 
Cys Pro Thr 
585 

tgt ggt tct 
Cys Gly Ser 



tac aga etc 
Tyr Arg Leu 



ata aga atg 
lie Arg Met 

635 

aac ttc act 
Asn Phe Thr 
650 

cag ctg tct 
Gin Leu Ser 
665 

tgc acc tac 
Cys Thr Tyr 



Arg Asn lie 
460 

gat aat gtc 
Asp Asn Val 



ccc cca aag 
Pro Pro Lys 



gtg tac tgt 
Val Tyr Cys 
510 

cgt gga gtg 
Arg Gly Val 
525 

eta ctg aac 
Leu Leu Asn 
540 

tgg atg aac 
Trp Met Asn 



cgc acc aga 
Arg Thr Arg 



gat tgt ttt 
Asp Cys Phe 
5 90 

ggg ccc tgg 
Gly Pro Trp 
605 

tgg cat tac 
Trp His Tyr 
620 

tat gta ggg 
Tyr Val Gly 



cgt ggg gat 
Arg Gly Asp 



cct ctg ttg 
Pro Leu Leu 
670 

tea gac tta 
Ser Asp Leu 



Glu Ala Phe 
465 

acc aat cca 
Thr Asn Pro 
480 

ccg tgt ggc 
Pro Cys Gly 
495 

ttc acc ccc 
Phe Thr Pro 



ccc acc tac 
Pro Thr Tyr 



age acc cga 
Ser Thr Arg 
545 

tec act ggt 
Ser Thr Gly 
560 

get gac ttc 
Ala Asp Phe 
575 

agg aag cat 
Arg Lys His 



etc aca cca 
Leu Thr Pro 



ccc tgc aca 
Pro Cys Thr 
625 

ggg gtt gag 
Gly Val Glu 
640 

cgc tgc gac 
Arg Cys Asp 
655 

cac tct acc 
His Ser Thr 



ccc get ttg 
Pro Ala Leu 



Arg He Gly 



gag gat atg 
Glu Asp Met 



gta gtc ccc 
Val Val Pro 
500 

age ccg gta 
Ser Pro Val 
515 

aca tgg gga 
Thr Trp Gly 
530 

ccg ccg cag 
Pro Pro Gin 



ttc acc aag 
Phe Thr Lys 



aac gee age 
Asn Ala Ser 
580 

cct gat gee 
Pro Asp Ala 
595 

aag tgc ctg 
Lys Cys Leu 
610 

gtc aat ttt 
Val Asn Phe 



cac agg etc 
His Arg Leu 



ttg gag gac 
Leu Glu Asp 
660 

acg gaa tgg 
Thr Glu Trp 
675 

tea act ggt 
Ser Thr Gly 



Trp 



agg 1795 

Arg 

485 

gcg 1843 
Ala 



gta 1891 
Val 



gag 1939 
Glu 



ggc 1987 
Gly 



act 2035 

Thr 

565 

acg 2083 
Thr 



act 2131 
Thr 



gtc 2179 
Val 



acc 2227 
Thr 



acg 2275 

Thr 

645 

agg 2323 
Arg 



gee 2371 
Ala 



ctt 2419 
Leu 
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680 685 690 

etc cac ctt cac cag aac ate gtg gac gta caa tac atg tat ggc etc 2467 
Leu His Leu His Gin Asn He Val Asp Val Gin Tyr Met Tyr Gly Leu 
695 700 705 

tea cct get ate aca aaa tac gtc gtt cga tgg gag tgg gtg gta etc 2515 
Ser Pro Ala He Thr Lys Tyr Val Val Arg Trp Glu Trp Val Val Leu 
710 715 720 725 

tta ttc ctg etc tta gcg gac gec aga gtc tgc gec tgc ttg tgg atg 2563 
Leu Phe Leu Leu Leu Ala Asp Ala Arg Val Cys Ala Cys Leu Trp Met 
730 735 740 

etc ate ttg ttg ggc cag gee gaa gca gca ttg gag aag ttg gtc gtc 2611 
Leu He Leu Leu Gly Gin Ala Glu Ala Ala Leu Glu Lys Leu Val Val 
745 "* 750 755 

ttg cac get gcg agt gcg get aac tgc cat ggc etc eta tat ttt gec 2659 
Leu His Ala Ala Ser Ala Ala Asn Cys His Gly Leu Leu Tyr Phe Ala 
760 765 770 

ate ttc ttc gtg gca get tgg cac ate agg ggt egg gtg gtc ccc ttg 2707 
He Phe Phe Val Ala Ala Trp His He Arg Gly Arg Val Val Pro Leu 
775 780 785 

acc acc tat tgc etc act ggc eta tgg ccc ttc tgc eta ctg etc atg 2755 
Thr Thr Tyr Cys Leu Thr Gly Leu Trp Pro Phe Cys Leu Leu Leu Met 
790 795 800 805 

gca ctg ccc egg cag get tat gec tat gac gca cct gtg cac gga cag 2803 
Ala Leu Pro Arg Gin Ala Tyr Ala Tyr Asp Ala Pro Val His Gly Gin 
810 815 820 

ata ggc gtg ggt ttg ttg ata ttg ate acc etc ttc aca etc acc ccg 2851 
He Gly Val Gly Leu Leu He Leu He Thr Leu Phe Thr Leu Thr Pro 
825 830 835 

ggg tat aag acc etc etc ggc cag tgt ctg tgg tgg ttg tgc tat etc 2899 
Gly Tyr Lys Thr Leu Leu Gly Gin Cys Leu Trp Trp Leu Cys Tyr Leu 
840 845 850 

ctg acc ctg ggg gaa gee atg att cag gag tgg gta cca ccc atg cag 2947 
Leu Thr Leu Gly Glu Ala Met He Gin Glu Trp Val Pro Pro Met Gin 
855 860 865 

gtg cgc ggc ggc cgc gat ggc ate gcg tgg gee gtc act ata ttc tgc 2995 
Val Arg Gly Gly Arg Asp Gly He Ala Trp Ala Val Thr He Phe Cys 
870 875 880 885 

ccg ggt gtg gtg ttt gac att acc aaa tgg ctt ttg gcg ttg ctt ggg 3043 
Pro Gly Val Val Phe Asp He Thr Lys Trp Leu Leu Ala Leu Leu Gly 
890 895 900 

cct get tac etc tta agg gec get ttg aca cat gtg ccg tac ttc gtc 3091 
Pro Ala Tyr Leu Leu Arg Ala Ala Leu Thr His Val Pro Tyr Phe Val 
905 910 915 
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aga get cac get ctg ata agg gta tgc get ttg gtg aag cag etc gcg 3139 
Arg Ala His Ala Leu lie Arg Val Cys Ala Leu Val Lys Gin Leu Ala 
920 925 930 

ggg ggt agg tat gtt cag gtg gcg eta ttg gee ctt ggc agg tgg act 3187 
Gly Gly Arg Tyr Val Gin Val Ala Leu Leu Ala Leu Gly Arg Trp Thr 
935 940 945 

ggc acc tac ate tat gac cac etc aca cct atg teg gac tgg gec get 3235 
Gly Thr Tyr lie Tyr Asp His Leu Thr Pro Met Ser Asp Trp Ala Ala 
950 955 960 965 

age ggc ctg cgc gac tta gcg gtc gee gtg gaa ccc ate ate ttc agt 3283 
Ser Gly Leu Arg Asp Leu Ala Val Ala Val Glu Pro lie lie Phe Ser 
970 97 5 980 

ccg atg gag aag aag gtc ate gtc tgg gga gcg gag acg get gca tgt 3331 
Pro Met Glu Lys Lys Val He Val Trp Gly Ala Glu Thr Ala Ala Cys 
985 990 995 

ggg gac att eta cat gga ctt ccc gtg tec gec cga etc ggc cag gag 3379 
Gly Asp He Leu His Gly Leu Pro Val Ser Ala Arg Leu Gly Gin Glu 
1000 1005 1010 

ate etc etc ggc cca get gat ggc tac acc tec aag ggg tgg aag etc 3427 
He Leu Leu Gly Pro Ala Asp Gly Tyr Thr Ser Lys Gly Trp Lys Leu 
1015 1020 1025 

ctt get ccc ate act get tat gee cag caa aca cga ggc etc ctg ggc 3475 
Leu Ala Pro He Thr Ala Tyr Ala Gin Gin Thr Arg Gly Leu Leu Gly 
1030 1035 1040 1045 

gec ata gtg gtg agt atg acg ggg cgt gac agg aca gaa cag gec ggg 3523 
Ala He Val Val Ser Met Thr Gly Arg Asp Arg Thr Glu Gin Ala Gly 
1050 1055 1060 

gaa gtc caa ate ctg tec aca gtc tct cag tec ttc etc gga aca acc 3571 
Glu Val Gin He Leu Ser Thr Val Ser Gin Ser Phe Leu Gly Thr Thr 
1065 1070 1075 

ate teg ggg gtt ttg tgg act gtt tac cac gga get ggc aac aag act 3619 
He Ser Gly Val Leu Trp Thr Val Tyr His Gly Ala Gly Asn Lys Thr 
1080 1085 1090 

eta gec ggc tta egg ggt ccg gtc acg cag atg tac teg agt get gag 3667 
Leu Ala Gly Leu Arg Gly Pro Val Thr Gin Met Tyr Ser Ser Ala Glu 
1095 1100 1105 

ggg gac ttg gta ggc tgg ccc age ccc cct ggg acc aag tct ttg gag 3715 
Gly Asp Leu Val Gly Trp Pro Ser Pro Pro Gly Thr Lys Ser Leu Glu 
1110 1115 1120 1125 

ccg tgc aag tgt gga gec gtc gac eta tat ctg gtc acg egg aac get 3763 
Pro Cys Lys Cys Gly Ala Val Asp Leu Tyr Leu Val Thr Arg Asn Ala 
1130 1135 1140 
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gat gtc ate ccg get egg aga cgc ggg gac aag egg gga gca ttg etc 3811 
Asp Val lie Pro Ala Arg Arg Arg Gly Asp Lys Arg Gly Ala Leu Leu 
1145 1150 1155 

tec ccg aga ccc att teg ace ttg aag ggg tec teg ggg ggg ccg gtg 3859 
Ser Pro Arg Pro He Ser Thr Leu Lys Gly Ser Ser Gly Gly Pro Val 
1160 1165 1170 

etc tgc cct agg ggc cac gtc gtt ggg etc ttc cga gca get gtg tgc 3907 
Leu Cys Pro Arg Gly His Val Val Gly Leu Phe Arg Ala Ala Val Cys 
1175 1180 1185 

tct egg ggc gtg gee aaa tec ate gat ttc ate ccc gtt gag aca etc 3955 
Ser Arg Gly Val Ala Lys Ser He Asp Phe He Pro Val Glu Thr Leu 
1190 " 1195 1200 1205 

gac gtt gtt aca agg tct ccc act ttc agt gac aac age acg cca ccg 4003 
Asp Val Val Thr Arg Ser Pro Thr Phe Ser Asp Asn Ser Thr Pro Pro 
1210 1215 1220 

get gtg ccc cag acc tat cag gtc ggg tac ttg cat get cca act ggc 4051 
Ala Val Pro Gin Thr Tyr Gin Val Gly Tyr Leu His Ala Pro Thr Gly 
1225 1230 1235 

agt gga aag age acc aag gtc cct gtc gcg tat gee gee cag ggg tac 4099 
Ser Gly Lys Ser Thr Lys Val Pro Val Ala Tyr Ala Ala Gin Gly Tyr 
1240 1245 1250 

aaa gta eta gtg ctt aac ccc teg gta get gee acc ctg ggg ttt ggg 4147 
Lys Val Leu Val Leu Asn Pro Ser Val Ala Ala Thr Leu Gly Phe Gly 
1255 1260 1265 

gcg tac eta tec aag gca cat ggc ate aat ccc aac att agg act gga 4195 
Ala Tyr Leu Ser Lys Ala His Gly He Asn Pro Asn He Arg Thr Gly 
1270 " 1275 1280 1285 

gtc agg acc gtg atg acc ggg gag gee ate acg tac tec aca tat ggc 4243 
Val Arg Thr Val Met Thr Gly Glu Ala He Thr Tyr Ser Thr Tyr Gly 
1290 1295 1300 

aaa ttt etc gee gat ggg ggc tgc get age ggc gee tat gac ate ate 4291 
Lys Phe Leu Ala Asp Gly Gly Cys Ala Ser Gly Ala Tyr Asp He He 
1305 1310 1315 

ata tgc gat gaa tgc cac get gtg gat get acc tec att etc ggc ate 4339 
He Cys Asp Glu Cys His Ala Val Asp Ala Thr Ser He Leu Gly He 
1320 ' 1325 1330 

gga acg gtc ctt gat caa gca gag aca gee ggg gtc aga eta act gtg 4387 
Gly Thr Val Leu Asp Gin Ala Glu Thr Ala Gly Val Arg Leu Thr Val 
1335 1340 1345 

ctg get acg gee aca ccc ccc ggg tea gtg aca acc ccc cat ccc gat 4435 
Leu Ala Thr Ala Thr Pro Pro Gly Ser Val Thr Thr Pro His Pro Asp 
1350 1355 1360 1365 

ata gaa gag gta ggc etc ggg egg gag ggt gag ate ccc ttc tat ggg 4 4 83 
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He Glu Glu Val Gly Leu Gly Arg Glu Gly Glu He Pro Phe Tyr Gly 
1370 1375 1380 



agg gcg att ccc eta tec tgc ate aag gga ggg aga cac ctg att ttc 4531 
Arg Ala He Pro Leu Ser Cys He Lys Gly Gly Arg His Leu He Phe 
1385 1390 1395 

tgc cac tea aag aaa aag tgt gac gag etc gcg gcg gee ctt egg ggc 4579 
Cys His Ser Lys Lys Lys Cys Asp Glu Leu Ala Ala Ala Leu Arg Gly 
1400 1405 1410 

atg ggc ttg aat gec gtg gca tac tat aga ggg ttg gac gtc tec ata 4627 
Met Gly Leu Asn Ala Val Ala Tyr Tyr Arg Gly Leu Asp Val Ser He 
1415 1420 1425 

ata cca get cag gga gat gtg gtg gtc gtc gec acc gac gec etc atg 4 675 
lie Pro Ala Gin Gly Asp Val Val Val Val Ala Thr Asp Ala Leu Met 
1430 1435 1440 1445 

acg ggg tac act gga gac ttt gac tec gtg ate gac tgc aat gta gcg 4723 
Thr Gly Tyr Thr Gly Asp Phe Asp Ser Val He Asp Cys Asn Val Ala 
1450 1455 1460 

gtc acc caa get gtc gac ttc age ctg gac ccc acc ttc act ata acc 4771 
Val Thr Gin Ala Val Asp Phe Ser Leu Asp Pro Thr Phe Thr He Thr 
1465 1470 1475 

aca cag act gtc cca caa gac get gtc tea cgc agt cag cgc cgc ggg 4819 
Thr Gin Thr Val Pro Gin Asp Ala Val Ser Arg Ser Gin Arg Arg Gly 
1480 1485 1490 

cgc aca ggt aga gga aga cag ggc act tat agg tat gtt tec act ggt 4867 
Arg Thr Gly Arg Gly Arg Gin Gly Thr Tyr Arg Tyr Val Ser Thr Gly 
1495 1500 1505 

gaa cga gec tea gga atg ttt gac agt gta gtg ctt tgt gag tgc tac 4915 
Glu Arg Ala Ser Gly Met Phe Asp Ser Val Val Leu Cys Glu Cys Tyr 
1510 ~ 1515 1520 1525 

gac gca ggg get gcg tgg tac gat etc aca cca gcg gag acc acc gtc 4 963 
Asp Ala Gly Ala Ala Trp Tyr Asp Leu Thr Pro Ala Glu Thr Thr Val 
1530 1535 1540 

agg ctt aga gcg tat ttc aac acg ccc ggc eta ccc gtg tgt caa gac 5011 
Arg Leu Arg Ala Tyr Phe Asn Thr Pro Gly Leu Pro Val Cys Gin Asp 
1545 1550 1555 

cat ctt gaa ttt tgg gag gca gtt ttc acc ggc etc aca cac ata gac 5059 
His Leu Glu Phe Trp Glu Ala Val Phe Thr Gly Leu Thr His He Asp 
1560 1565 1570 

gec cac ttc etc tec caa aca aag caa gcg ggg gag aac ttc gcg tac 5107 
Ala His Phe Leu Ser Gin Thr Lys Gin Ala Gly Glu Asn Phe Ala Tyr 
1575 1580 1585 

eta gta gee tac caa get acg gtg tgc gee aga gec aag gee cct ccc 5155 
Leu Val Ala Tyr Gin Ala Thr Val Cys Ala Arg Ala Lys Ala Pro Pro 
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1590 



1595 



1600 



1605 



ccg tec tgg gac gec atg tgg aag tgc ctg gec cga etc aag cct acg 5203 
Pro Ser Trp Asp Ala Met Trp Lys Cys Leu Ala Arg Leu Lys Pro Thr 
1610 1615 1620 

ctt gcg ggc ccc aca cct etc ctg tac cgt ttg ggc cct att acc aat 5251 
Leu Ala Gly Pro Thr Pro Leu Leu Tyr Arg Leu Gly Pro lie Thr Asn 
1625 1630 1635 

gag gtc acc etc aca cac cct ggg acg aag tac ate gec aca tgc atg 5299 
Glu Val Thr Leu Thr His Pro Gly Thr Lys Tyr lie Ala Thr Cys Met 
1640 1645 1650 

caa get gac ctt gag gtc atg acc age acg tgg gtc eta get gga gga 5347 
Gin Ala Asp Leu Glu Val Met Thr Ser Thr Trp Val Leu Ala Gly Gly 
1655 1660 1665 

gtc ctg gca gee gtc gee gca tat tgc ctg gcg act gga tgc gtt tec 5395 
Val Leu Ala Ala Val Ala Ala Tyr Cys Leu Ala Thr Gly Cys Val Ser 
1670 1675 1680 1685 

ate ate ggc cgc ttg cac gtc aac cag cga gtc gtc gtt gcg ccg gat 5443 
He He Gly Arg Leu His Val Asn Gin Arg Val Val Val Ala Pro Asp 
1690 1695 1700 

aag gag gtc ctg tat gag get ttt gat gag atg gag gaa tgc gee tct 5491 
Lys Glu Val Leu Tyr Glu Ala Phe Asp Glu Met Glu Glu Cys Ala Ser 
1705 1710 1715 

agg gcg get etc ate gaa gag ggg cag egg ata gee gag atg ttg aag 5539 
Arg Ala Ala Leu He Glu Glu Gly Gin Arg He Ala Glu Met Leu Lys 
1720 1725 1730 

tec aag ate caa ggc ttg ctg cag cag gee tct aag cag gee cag gac 5587 
Ser Lys He Gin Gly Leu Leu Gin Gin Ala Ser Lys Gin Ala Gin Asp 
1735 1740 1745 

ata caa ccc get atg cag get tea tgg ccc aaa gtg gaa caa ttt tgg 5635 
He Gin Pro Ala Met Gin Ala Ser Trp Pro Lys Val Glu Gin Phe Trp 
1750 1755 1760 1765 

gee aga cac atg tgg aac ttc att age ggc ate caa tac etc gca gga 5683 
Ala Arg His Met Trp Asn Phe He Ser Gly He Gin Tyr Leu Ala Gly 
1770 1775 1780 

ttg tea aca ctg cca ggg aac ccc gcg gtg get tec atg atg gca ttc 5731 
Leu Ser Thr Leu Pro Gly Asn Pro Ala Val Ala Ser Met Met Ala Phe 
1785 1790 1795 

agt gec gec etc acc agt ccg ttg teg acc agt acc acc ate ctt etc 5779 
Ser Ala Ala Leu Thr Ser Pro Leu Ser Thr Ser Thr Thr He Leu Leu 
1800 1805 1810 

aac ate atg gga ggc tgg tta gcg tec cag ate gca cca ccc gcg ggg 5827 
Asn He Met Gly Gly Trp Leu Ala Ser Gin He Ala Pro Pro Ala Gly 
1815 ~ 1820 1825 
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gcc acc ggc ttt gtc gtc agt ggc ctg gtg ggg get gec gtg ggc age 
Ala Thr Gly Phe Val Val Ser Gly Leu Val Gly Ala Ala Val Gly Ser 
1830 1835 1840 1845 



5875 



ata ggc ctg ggt aag gtg ctg gtg gac ate ctg gca gga tat ggt gcg 5923 
He Gly Leu Gly Lys Val Leu Val Asp He Leu Ala Gly Tyr Gly Ala 
1850 1855 1860 

ggc att teg ggg gcc etc gtc gca ttc aag ate atg tct ggc gag aag 5971 
Gly He Ser Gly Ala Leu Val Ala Phe Lys He Met Ser Gly Glu Lys 
1865 1870 1875 

ccc tct atg gaa gat gtc ate aat eta ctg cct ggg ate ctg tct ccg 6019 
Pro Ser Met Glu Asp Val He Asn Leu Leu Pro Gly He Leu Ser Pro 
1880 1885 1890 

gga gcc ctg gtg gtg ggg gtc ate tgc gcg gcc att ctg cgc cgc cac 6067 
Gly Ala Leu Val Val Gly Val He Cys Ala Ala He Leu Arg Arg His 
1895 1900 1905 

gtg gga ccg ggg gag ggc gcg gtc caa tgg atg aac agg ctt att gcc 6115 
Val Gly Pro Gly Glu Gly Ala Val Gin Trp Met Asn Arg Leu He Ala 
1910 ~ 1915 1920 1925 

ttt get tec aga gga aac cac gtc gcc cct act cac tac gtg acg gag 6163 
Phe Ala Ser Arg Gly Asn His Val Ala Pro Thr His Tyr Val Thr Glu 
1930 1935 1940 

teg gat gcg teg cag cgt gtg acc caa eta ctt ggc tct ctt act ata 6211 
Ser Asp Ala Ser Gin Arg Val Thr Gin Leu Leu Gly Ser Leu Thr He 
1945 1950 1955 

acc age eta etc aga aga etc cac aat tgg ata act gag gac tgc ccc 6259 
Thr Ser Leu Leu Arg Arg Leu His Asn Trp He Thr Glu Asp Cys Pro 
1960 1965 ~ 1970 

ate cca tgc tec gga tec tgg etc cgc gac gtg tgg gac tgg gtt tgc 6307 
He Pro Cys Ser Gly Ser Trp Leu Arg Asp Val Trp Asp Trp Val Cys 

1975 1980 1985 

acc ate ttg aca gac ttc aaa aat tgg ctg acc tct aaa ttg ttc ccc 6355 
Thr He Leu Thr Asp Phe Lys Asn Trp Leu Thr Ser Lys Leu Phe Pro 
1990 1995 2000 2005 

aag ctg ccc ggc etc ccc ttc ate tct tgt caa aag ggg tac aag ggt 6403 
Lys Leu Pro Gly Leu Pro Phe He Ser Cys Gin Lys Gly Tyr Lys Gly 
2010 2015 2020 

gtg tgg gcc ggc act ggc ate atg acc acg cgc tgc cct tgc ggc gcc 6451 
Val Trp Ala Gly Thr Gly He Met Thr Thr Arg Cys Pro Cys Gly Ala 
2025 2030 2035 

aac ate tct ggc aat gtc cgc ctg ggc tct atg agg ate aca ggg cct 64 99 
Asn He Ser Gly Asn Val Arg Leu Gly Ser Met Arg He Thr Gly Pro 
2040 2045 2050 
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aaa acc tgc atg aac acc tgg cag ggg acc ttt cct ate aat tgc tac 6547 
Lys Thr Cys Met Asn Thr Trp Gin Gly Thr Phe Pro He Asn Cys Tyr 
2055 2060 2065 

acg gag ggc cag tgc gcg ccg aaa ccc ccc acg aac tac aag acc gec 6595 
Thr Glu Gly Gin Cys Ala Pro Lys Pro Pro Thr Asn Tyr Lys Thr Ala 
2070 2075 2080 2085 

ate tgg agg gtg gcg gec teg gag tac gcg gag gtg acg cag cat ggg 6643 
He Trp Arg Val Ala Ala Ser Glu Tyr Ala Glu Val Thr Gin His Gly 
2090 2095 2100 

teg tac tec tat gta aca gga ctg acc act gac aat ctg aaa att cct 6691 
Ser Tyr Ser Tyr Val Thr Gly Leu Thr Thr Asp Asn Leu Lys He Pro 
2105 2110 2115 

tgc caa eta cct tct cca gag ttt ttc tec tgg gtg gac ggt gtg cag 6739 
Cys Gin Leu Pro Ser Pro Glu Phe Phe Ser Trp Val Asp Gly Val Gin 
2120 2125 * 2130 

ate cat agg ttt gca ccc aca cca aag ccg ttt ttc egg gat gag gtc 6787 
He His Arg Phe Ala Pro Thr Pro Lys Pro Phe Phe Arg Asp Glu Val 
2135 ' 2140 2145 

teg ttc tgc gtt ggg ctt aat tec tat get gtc ggg tec cag ctt ccc 6835 
Ser Phe Cys Val Gly Leu Asn Ser Tyr Ala Val Gly Ser Gin Leu Pro 
2150 2155 2160 2165 

tgt gaa cct gag ccc gac gca gac gta ttg agg tec atg eta aca gat 6883 
Cys Glu Pro Glu Pro Asp Ala Asp Val Leu Arg Ser Met Leu Thr Asp 
2170 2175 2180 

ccg ccc cac ate acg gcg gag act gcg gcg egg cgc ttg gca egg gga 6931 
Pro Pro His He Thr Ala Glu Thr Ala Ala Arg Arg Leu Ala Arg Gly 
2185 2190 2195 

tea cct cca tct gag gcg age tec tea gtg age cag eta tea gca ccg 6979 
Ser Pro Pro Ser Glu Ala Ser Ser Ser Val Ser Gin Leu Ser Ala Pro 
2200 2205 2210 

teg ctg egg gec acc tgc acc acc cac age aac acc tat gac gtg gac 7027 
Ser Leu Arg Ala Thr Cys Thr Thr His Ser Asn Thr Tyr Asp Val Asp 
2215 2220 2225 

atg gtc gat gec aac ctg etc atg gag ggc ggt gtg get cag aca gag 7075 
Met Val Asp Ala Asn Leu Leu Met Glu Gly Gly Val Ala Gin Thr Glu 
2230 2235 2240 2245 

cct gag tec agg gtg ccc gtt ctg gac ttt etc gag cca atg gee gag 7123 
Pro Glu Ser Arg Val Pro Val Leu Asp Phe Leu Glu Pro Met Ala Glu 
2250 2255 . 2260 

gaa gag age gac ctt gag ccc tea ata cca teg gag tgc atg etc ccc 7171 
Glu Glu Ser Asp Leu Glu Pro Ser He Pro Ser Glu Cys Met Leu Pro 
2265 2270 2275 
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agg age ggg ttt cca egg gec tta ccg get tgg gca egg cct gac tac 7219 
Arg Ser Gly Phe Pro Arg Ala Leu Pro Ala Trp Ala Arg Pro Asp Tyr 
2280 2285 2290 

aac ccg ccg etc gtg gaa teg tgg agg agg cca gat tac caa ccg ccc 7267 
Asn Pro Pro Leu Val Glu Ser Trp Arg Arg Pro Asp Tyr Gin Pro Pro 
2295 2300 ~ 2305 

ace gtt get ggt tgt get etc ccc ccc ccc aag aag gee ccg acg cct 7315 
Thr Val Ala Gly Cys Ala Leu Pro Pro Pro Lys Lys Ala Pro Thr Pro 
2310 2315 2320 2325 

ccc cca agg aga cgc egg aca gtg ggt ctg age gag age acc ata tea 7363 
Pro Pro Arg Arg Arg Arg Thr Val Gly Leu Ser Glu Ser Thr lie Ser 
2330 2335 2340 

gaa gee etc cag caa ctg gee ate aag acc ttt ggc cag ccc ccc teg 7411 
Glu Ala Leu Gin Gin Leu Ala lie Lys Thr Phe Gly Gin Pro Pro Ser 
2345 2350 2355 

age ggt gat gca ggc teg tec acg ggg gcg ggc gee gee gaa tec ggc 7459 
Ser Gly Asp Ala Gly Ser Ser Thr Gly Ala Gly Ala Ala Glu Ser Gly 
2360 2365 2370 

ggt ccg acg tec cct ggt gag ccg gee ccc tea gag aca ggt tec gee 7507 
Gly Pro Thr Ser Pro Gly Glu Pro Ala Pro Ser Glu Thr Gly Ser Ala 
2375 2380 2385 

tec tct atg ccc ccc etc gag ggg gag cct gga gat ccg gac ctg gag 7555 
Ser Ser Met Pro Pro Leu Glu Gly Glu Pro Gly Asp Pro Asp Leu Glu 
2390 2395 2400 2405 

tct gat cag gta gag ctt caa cct ccc ccc cag ggg ggg ggg gta get 7603 
Ser Asp Gin Val Glu Leu Gin Pro Pro Pro Gin Gly Gly Gly Val Ala 
2410 2415 2420 

ccc ggt teg ggc teg ggg tct tgg tct act tgc tec gag gag gac gat 7651 
Pro Gly Ser Gly Ser Gly Ser Trp Ser Thr Cys Ser Glu Glu Asp Asp 
2425 2430 2435 

acc acc gtg tgc tgc tec atg tea tac tec tgg acc ggg get eta ata 7699 
Thr Thr Val Cys Cys Ser Met Ser Tyr Ser Trp Thr Gly Ala Leu lie 
2440 2445 2450 

act ccc tgt age ccc gaa gag gaa aag ttg cca ate aac cct ttg agt 7747 
Thr Pro Cys Ser Pro Glu Glu Glu Lys Leu Pro lie Asn Pro Leu Ser 
2455 2460 2465 

aac teg ctg ttg cga tac cat aac aag gtg tac tgt aca aca tea aag 7795 
Asn Ser Leu Leu Arg Tyr His Asn Lys Val Tyr Cys Thr Thr Ser Lys 
2470 2475 2480 2485 

age gee tea cag agg get aaa aag gta act ttt gac agg acg caa gtg 7843 
Ser Ala Ser Gin Arg Ala Lys Lys Val Thr Phe Asp Arg Thr Gin Val 
2490 2495 2500 

etc gac gee cat tat gac tea gtc tta aag gac ate aag eta gcg get 7891 
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Leu Asp Ala His Tyr Asp Ser Val Leu Lys Asp lie Lys Leu Ala Ala 
2505 * 2510 " 2515 



tec aag gtc age gca agg etc etc ace ttg gag gag gcg tgc cag ttg 7939 

Ser Lys Val Ser Ala Arg Leu Leu Thr Leu Glu Glu Ala Cys Gin Leu 

2520 2525 2530 

act cca ccc cat tct gca aga tec aag tat gga ttc ggg gee aag gag 7987 

Thr Pro Pro His Ser Ala Arg Ser Lys Tyr Gly Phe Gly Ala Lys Glu 

2535 2540 " 2545 

gtc cgc age ttg tec ggg agg gee gtt aac cac ate aag tec gtg tgg 8035 

Val Arg Ser Leu Ser Gly Arg Ala Val Asn His lie Lys Ser Val Trp 

2550 2555 2560 2565 

aag gac etc ctg gaa gac cca caa aca cca att ccc aca acc ate atg 8083 

Lys Asp Leu Leu Glu Asp Pro Gin Thr Pro lie Pro Thr Thr lie Met 

2570 2575 2580 

gec aaa aat gag gtg ttc tgc gtg gac ccc gec aag ggg ggt aag aaa 8131 

Ala Lys Asn Glu Val Phe Cys Val Asp Pro Ala Lys Gly Gly Lys Lys 

2585 2590 2595 

cca get cgc etc ate gtt tac cct gac etc ggc gtc egg gtc tgc gag 8179 

Pro Ala Arg Leu lie Val Tyr Pro Asp Leu Gly Val Arg Val Cys Glu 

2600 2605 ~ 2610 

aaa atg gee etc tat gac att aca caa aag ctt cct cag gcg gta atg 8227 

Lys Met Ala Leu Tyr Asp lie Thr Gin Lys Leu Pro Gin Ala Val Met 

2615 " 2620 2625 

gga get tec tat ggc ttc cag tac tec cct gee caa egg gtg gag tat 8275 

Gly Ala Ser Tyr Gly Phe Gin Tyr Ser Pro Ala Gin Arg Val Glu Tyr 

2630 2635 2640 2645 

etc ttg aaa gca tgg gcg gaa aag aag gac ccc atg ggt ttt teg tat 8323 

Leu Leu Lys Ala Trp Ala Glu Lys Lys Asp Pro Met Gly Phe Ser Tyr 

2650 J " 2655 2660 

gat acc cga tgc ttc gac tea acc gtc act gag aga gac ate agg acc 8371 

Asp Thr Arg Cys Phe Asp Ser Thr Val Thr Glu Arg Asp lie Arg Thr 

2665 2670 2675 

gag gag tec ata tac cag gec tgc tec ctg ccc gag gag gee cgc act 8419 

Glu Glu Ser lie Tyr Gin Ala Cys Ser Leu Pro Glu Glu Ala Arg Thr 

2680 2685 2690 

gec ata cac teg ctg act gag aga ctt tac gta gga ggg ccc atg ttc 84 67 

Ala lie His Ser Leu Thr Glu Arg Leu Tyr Val Gly Gly Pro Met Phe 

2695 2700 2705 

aac age aag ggt caa acc tgc ggt tac aga cgt tgc cgc gee age ggg 8515 

Asn Ser Lys Gly Gin Thr Cys Gly Tyr Arg Arg Cys Arg Ala Ser Gly 

2710 2715 2720 2725 

gtg eta acc act age atg ggt aac acc ate aca tgc tat gtg aaa gee 8563 

Val Leu Thr Thr Ser Met Gly Asn Thr lie Thr Cys Tyr Val Lys Ala 



18 



2730 



2735 



2740 



eta gcg gec tgc aag get gcg ggg ata gtt gcg ccc aca atg ctg gta 8611 
Leu Ala Ala Cys Lys Ala Ala Gly He Val Ala Pro Thr Met Leu Val 
2745 2750 2755 

tgc ggc gat gac eta gta gtc ate tea gaa age cag ggg act gag gag 8659 
Cys Gly Asp Asp Leu Val Val He Ser Glu Ser Gin Gly Thr Glu Glu 
2760 2765 2770 

gac gag egg aac ctg aga gec ttc acg gag gee atg acc agg tac tct 8707 
Asp Glu Arg Asn Leu Arg Ala Phe Thr Glu Ala Met Thr Arg Tyr Ser 
2775 2780 2785 

gec cct cct ggt gat ccc ccc aga ccg gaa tat gac ctg gag eta ata 8755 
Ala Pro Pro Gly Asp Pro Pro Arg Pro Glu Tyr Asp Leu Glu Leu He 
2790 2795 2800 2805 

aca tec tgt tec tea aat gtg tct gtg gcg ttg ggc ccg egg ggc cgc 8803 
Thr Ser Cys Ser Ser Asn Val Ser Val Ala Leu Gly Pro Arg Gly Arg 
2810 2815 2820 

cgc aga tac tac ctg acc aga gac cca acc act cca etc gec egg get 8851 
Arg Arg Tyr Tyr Leu Thr Arg Asp Pro Thr Thr Pro Leu Ala Arg Ala 
2825 2830 2835 

gec tgg gaa aca gtt aga cac tec cct ate aat tea tgg ctg gga aac 8899 
Ala Trp Glu Thr Val Arg His Ser Pro He Asn Ser Trp Leu Gly Asn 
2840 2845 2850 

ate ate cag tat get cca acc ata tgg gtt cgc atg gtc eta atg aca 8947 
He He Gin Tyr Ala Pro Thr He Trp Val Arg Met Val Leu Met Thr 
2855 2860 2865 

cac ttc ttc tec att etc atg gtc caa gac acc ctg gac cag aac etc 8995 
His Phe Phe Ser He Leu Met Val Gin Asp Thr Leu Asp Gin Asn Leu 
2870 2875 2880 2885 

aac ttt gag atg tat gga tea gta tac tec gtg aat cct ttg gac ctt 9043 
Asn Phe Glu Met Tyr Gly Ser Val Tyr Ser Val Asn Pro Leu Asp Leu 
2890 ' 2895 2900 

cca gee ata att gag agg tta cac ggg ctt gac gee ttt tct atg cac 9091 
Pro Ala He He Glu Arg Leu His Gly Leu Asp Ala Phe Ser Met His 
2905 2910 2915 

aca tac tct cac cac gaa ctg acg egg gtg get tea gec etc aga aaa 9139 
Thr Tyr Ser His His Glu Leu Thr Arg Val Ala Ser Ala Leu Arg Lys 
2920 2925 2930 

ctt ggg gcg cca ccc etc agg gtg tgg aag agt egg get cgc gca gtc 9187 
Leu Gly Ala Pro Pro Leu Arg Val Trp Lys Ser Arg Ala Arg Ala Val 
2935 2940 2945 

agg gcg tec etc ate tec cgt gga ggg aaa gcg gee gtt tgc ggc cga 9235 
Arg Ala Ser Leu He Ser Arg Gly Gly Lys Ala Ala Val Cys Gly Arg 
2950 2955 2960 2965 
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tat etc ttc aat tgg gcg gtg aag acc aag etc aaa etc act cca ttg 
Tyr Leu Phe Asn Trp Ala Val Lys Thr Lys Leu Lys Leu Thr Pro Leu 
2970 2975 2980 



9283 



ccg gag gcg cgc eta ctg gac tta tec agt tgg ttc acc gtc ggc gee 9331 
Pro Glu Ala Arg Leu Leu Asp Leu Ser Ser Trp Phe Thr Val Gly Ala 
2985 2990 2995 

ggc ggg ggc gac att ttt cac age gtg teg cgc gee cga ccc cgc tea 9379 
Gly Gly Gly Asp lie Phe His Ser Val Ser Arg Ala Arg Pro Arg Ser 
3000 ' 3005 3010 

tta etc ttc ggc eta etc eta ctt ttc gta ggg gta ggc etc ttc eta 9427 
Leu Leu Phe Gly Leu Leu Leu Leu Phe Val Gly Val Gly Leu Phe Leu 
3015 3020 3025 

etc ccc get egg tag agcggcacac actaggtaca ctccatagct aactgttcct 9482 

Leu Pro Ala Arg 

3030 

tttttttttt tttttttttt tttttttttt tttttttttt ttcttttttt tttttttccc 9542 
tctttcttcc cttctcatct tattctactt tctttcttgg tggctccatc ttagecctag 9602 
teaeggctag ctgtgaaagg tccgtgagcc geatgactge agagagtgee gtaactggtc 9662 
tetctgeaga tcatgt 9678 



<210> 4 
<211> 3033 
<212> PRT 

<213> Hepatitis C virus 
<400> 4 

Met Ser Thr Asn Pro Lys Pro Gin Arg Lys Thr Lys Arg Asn Thr Asn 

15 10 15 

Arg Arg Pro Glu Asp Val Lys Phe Pro Gly Gly Gly Gin He Val Gly 

20 25 30 

Gly Val Tyr Leu Leu Pro Arg Arg Gly Pro Arg Leu Gly Val Arg Thr 

35 40 45 

Thr Arg Lys Thr Ser Glu Arg Ser Gin Pro Arg Gly Arg Arg Gin Pro 

50 J 55 60 

He Pro Lys Asp Arg Arg Ser Thr Gly Lys Ala Trp Gly Lys Pro Gly 
65 " 70 75 80 

Arg Pro Trp Pro Leu Tyr Gly Asn Glu Gly Leu Gly Trp Ala Gly Trp 
85 90 95 



Leu 


Leu 


Ser 


Pro 


Arg 


Gly 


Ser 


Arg 


Pro 


Ser 


Trp 


Gly 


Pro 


Thr 


Asp 


Pro 








100 










105 










110 






Arg 


His 


Arg 


Ser 


Arg 


Asn 


Val 


Gly 


Lys 


Val 


He 


Asp 


Thr 


Leu 


Thr 


Cys 






115 










120 










125 








Gly 


Phe 


Ala 


Asp 


Leu 


Met 


Gly 


Tyr 


He 


Pro 


Val 


Val 


Gly 


Ala 


Pro 


Leu 




130 










135 










140 










Ser 


Gly 


Ala 


Ala 


Arg 


Ala 


Val 


Ala 


His 


Gly 


Val 


Arg 


Val 


Leu 


Glu 


Asp 


145 










150 










155 










160 
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Gly Val 


Asn 


Tyr 


Ala 


Thr 


Gly 


Asn 


Leu 


Pro 


Gly 


Phe 


Pro 


Phe 


Ser 


He 










165 










170 










175 




Phe 


Leu 


Leu 


Ala 


Leu 


Leu 


Ser 


Cys 


lie 


Thr 


Val 


Pro 


Val 


Ser 


Ala 


Ala 








180 










185 










190 






Gin 


Val 


Lys 


Asn 


Thr 


Ser 


Ser 


Ser 


Tyr 


Met 


Val 


Thr 


Asn 


Asp 


Cys 


Ser 






195 










200 










205 








Asn 


Asp 


Ser 


He 


Thr 


Trp 


Gin 


Leu 


Glu 


Ala 


Ala 


Val 


Leu 


His 


Val 


Pro 




210 










215 










220 










Gly Cys 


Val 


Pro 


Cys 


Glu 


Arg 


Val 


Gly 


Asn 


Thr 


Ser 


Arg 


Cys 


Trp 


Val 


225 










230 










235 










240 


Pro 


Val 


Ser 


Pro 


Asn 


Met 


Ala 


Val 


Arg 


Gin 


Pro 


Gly 


Ala 


Leu 


Thr 


Gin 










245 










250 










255 




Gly 


Leu 


Arg 


Thr 


His 


lie 


Asp 


Met 


Val 


Val 


Met 


Ser 


Ala 


Thr 


Phe 


Cys 








260 










265 










270 






Ser 


Ala 


Leu 


Tyr 


Val 


Gly 


Asp 


Leu 


Cys 


Gly 


Gly 


Val 


Met 


Leu 


Ala 


Ala 






275 










280 










285 








Gin 


Val 


Phe 


He 


Val 


Ser 


Pro 


Gin 


Tyr 


His 


Trp 


Phe 


Val 


Gin 


Glu 


Cys 




290 










295 










300 










Asn 


Cys 


Ser 


He 


Tyr 


Pro 


Gly 


Thr 


He 


Thr 


Gly 


His 


Arg 


Met 


Ala 


Trp 


305 










310 










315 










320 


Asp 


Met 


Met 


Met 


Asn 


Trp 


Ser 


Pro 


Thr 


Ala 


Thr 


Met 


He 


Leu 


Ala 


Tyr 










325 










330 










335 




Val 


Met 


Arg 


Val 


Pro 


Glu 


Val 


lie 


He 


Asp 


He 


Val 


Ser 


Gly 


Ala 


His 








340 










345 










350 






Trp 


Gly 


Val 


Met 


Phe 


Gly 


Leu 


Ala 


Tyr 


Phe 


Ser 


Met 


Gin 


Gly 


Ala 


Trp 






355 










360 










365 








Ala 


Lys 


Val 


lie 


Val 


He 


Leu 


Leu 


Leu 


Ala 


Ala 


Gly 


Val 


Asp 


Ala 


Gly 




370 










375 










380 










Thr 


Thr 


Thr 


Val 


Gly 


Gly 


Ala 


Val 


Ala 


Arg 


Ser 


Thr 


Asn 


Val 


He 


Ala 


385 










390 










395 










400 


Gly Val 


Phe 


Ser 


His 


Gly 


Pro 


Gin 


Gin 


Asn 


lie 


Gin 


Leu 


He 


Asn 


Thr 










405 










410 










415 




Asn 


Gly 


Ser 


Trp 


His 


He 


Asn 


Arg 


Thr 


Ala 


Leu 


Asn 


Cys 


Asn 


Asp 


Ser 








420 










425 










430 






Leu 


Asn 


Thr 


Gly 


Phe 


Leu 


Ala 


Ala 


Leu 


Phe 


Tyr 


Thr 


Asn 


Arg 


Phe 


Asn 






435 










440 










445 








Ser 


Ser 


Gly 


Cys 


Pro 


Gly 


Arg 


Leu 


Ser 


Ala 


Cys 


Arg 


Asn 


He 


Glu 


Ala 




450 










455 










460 










Phe 


Arg 


He 


Gly 


Trp 


Gly 


Thr 


Leu 


Gin 


Tyr 


Glu 


Asp 


Asn 


Val 


Thr 


Asn 


465 










470 










475 










480 


Pro 


Glu 


Asp 


Met 


Arg 


Pro 


Tyr 


Cys 


Trp 


His 


Tyr 


Pro 


Pro 


Lys 


Pro 


Cys 










485 










490 










495 




Gly Val 


Val 


Pro 


Ala 


Arg 


Ser 


Val 


Cys 


Gly 


Pro 


Val 


Tyr 


Cys 


Phe 


Thr 








500 










505 










510 






Pro 


Ser 


Pro 


Val 


Val 


Val 


Gly 


Thr 


Thr 


Asp 


Arg 


Arg 


Gly 


Val 


Pro 


Thr 






515 










520 










525 








Tyr 


Thr 


Trp 


Gly 


Glu 


Asn 


Glu 


Thr 


Asp 


Val 


Phe 


Leu 


Leu 


Asn 


Ser 


Thr 




530 










535 










540 










Arg 


Pro 


Pro 


Gin 


Gly 


Ser 


Trp 


Phe 


Gly 


Cys 


Thr 


Trp 


Met 


Asn 


Ser 


Thr 


545 










550 










555 










560 


Gly 


Phe 


Thr 


Lys 


Thr 


Cys 


Gly 


Ala 


Pro 


Pro 


Cys 


Arg 


Thr 


Arg 


Ala 


Asp 










565 










570 










575 




Phe 


Asn 


Ala 


Ser 


Thr 


Asp 


Leu 


Leu 


Cys 


Pro 


Thr 


Asp 


Cys 


Phe 


Arg 


Lys 








580 










585 










590 






His 


Pro 


Asp 


Ala 


Thr 


Tyr 


lie 


Lys 


Cys 


Gly 


Ser 


Gly 


Pro 


Trp 


Leu 


Thr 






595 










600 










605 








Pro 


Lys 


Cys 


Leu 


Val 


His 


Tyr 


Pro 


Tyr 


Arg 


Leu 


Trp 


His 


Tyr 


Pro 


Cys 
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610 










615 




Thr 


Val 


Asn 


Phe 


Thr 


He 


Phe 


Lys 


625 










630 






Glu 


His 


Arg 


Leu 


Thr 


Ala 


Ala 


Cys 










645 








Asp 


Leu 


Glu 


Asp 


Arg 


Asp 


Arg 


Ser 








660 










Thr 


Thr 


Glu 


Trn 


Ala 


He 


Leu 


Pro 






675 










680 


Leu 


Ser 


Thr 


Gly 


Leu 


Leu 


His 


Leu 




690 










695 




Tvr 


Met 


Tvr 
y 


Glv 


Leu 


Ser 


Pro 


Ala 


705 










710 






Glu 


T rn 


Val 


Val 


Leu 


Leu 


Phe 


Leu 










725 








Ala 


Cys 


Leu 


Trn 


Met 


Leu 


He 


Leu 








7 4 0 










\J J- U. 






Val 


Val 


Leu 


His 


Ala 






755 










7 60 


Leu 


Leu 


Tvr 


Phe 


Ala 


He 


Phe 


Phe 




770 










775 




Arg 


Val 


Val 


Pro 


Leu 


Thr 


Thr 


Tvr 
y 


785 










790 






Cys 


Leu 


Leu 


Leu 


Met 


Ala 


Leu 


Pro 










805 








Pro 


Val 


His 


Gly 


Gin 


He 


Gly 


Val 








820 










Phe 


Thr 


Leu 


Thr 


Pro 


Gly 


Tyr 


Lys 






835 










840 




Leu 




Tvr 
i y r 




Leu 


Thr 


Leu 




850 










855 




Val 


Pro 


Pro 


Met 


Gin 


Val 


Arg 


Gly 


865 










870 






Val 


Thr 


He 


Phe 


Cys 
885 


Pro 


Gly 


Val 


Leu 


Ala 


Leu 


Leu 


Glv 
y 


Pro 


Ala 


Tvr 

y 








900 










Val 


Pro 


Tvr 
y 


Phe 


Val 


Arg 


Ala 


His 






915 










920 


Val 


Lys 


Gin 


Leu 


Ala 


Gly 


Gly 


Arg 




930 










935 




Leu 


Gly 


Arg 


Tro 


Thr 


Gly 


Thr 


Tvr 


94 5 










950 






Ser 


Asp 


Trn 


Ala 


Ala 


Ser 


Gly 


Leu 










965 








Pro 


He 


He 


Phe 


Ser 


Pro 


Met 


Glu 








980 










Glu 


Thr 


Ala 


Ala 


Cys 


Gly 


Asp 


He 






995 








1000 


Arg 


Leu 


Gly 


Gin 


Glu 


He 


Leu 


Leu 


1010 








1015 




Lys 


Gly 


Trp 


Lys 


Leu 


Leu 


Ala 


Pro 


1025 






1030 






Arg 


Gly 


Leu 


Leu 


Gly Ala 


He 


Val 








1045 








Thr 


Glu 


Gin 


Ala 


Gly Glu 


Val 


Gin 



1060 



620 



He 


Arg 


Met 


Tvr 
y 


Val 


Glv 


Gly Val 






635 










640 


Asn 


Phe 


Thr 


Arg 


Glv 


Asp 


Arg 


Cys 




650 










655 




Gin 


Leu 


Ser 


Pro 


Leu 


Leu 


His 


Ser 


665 










670 






Cys 


Thr 


Tvr 
1 y ± - 


Ser 


Asp 


Leu 


Pro 


Ala 










685 








His 


Gin 


Asn 


lie 


Val 


Asp 


Val 


Gin 








700 










He 


Thr 


Lys 


Tvr 
y 


Val 


Val 


Arq 


Trp 






715 










720 


Leu 


Leu 


Ala 


Asp 


Ala 


Arg 


Val 


Cys 
y 




730 










735 




Leu 


Gly 


Gin 


Ala 


Glu 


Ala 


Ala 


Leu 


745 










750 






Ala 


Ser 


Ala 


Ala 


Asn 


Cys 


His 


Gly 










765 








Val 


Ala 


Ala 


Trn 


His 


He 


Arg 


Gly 

y 








780 










Cys 


Leu 


Thr 


Glv 


Leu 


Tro 


Pro 


Phe 






795 










800 


Arg 


Gin 


Ala 


Tvr 

x y x 


Ala 


Tvr 
y 


Asp 


Ala 




810 










815 




Gly 


Leu 


Leu 


Ile 


Leu 


lie 


Thr 


Leu 


825 










830 






Thr 


Leu 


Leu 


Gly 


Gin 


Cys 


Leu 


Tro 










84 5 








Gly 


Glu 


Ala 


Met 


He 


Gin 


Glu 


Tro 








860 










Gly 


Arg 


Asp 


Gly 


He 


Ala 


Tro 

X X. £J 


Ala 






875 










880 


Val 


Phe 


Asp 


He 


Thr 


Lys 


Tro 

X X £J 


Leu 




890 










895 




Leu 


Leu 


Arg 


Ala 


Ala 


Leu 


Thr 


His 


905 










910 






Ala 


Leu 


He 


Arg 


Val 


Cys 


Ala 


Leu 










925 








Tvr 


Val 


Gin 


Val 


Ala 


Leu 


Leu 


Ala 








940 










He 


Tvr 


Asp 


His 


Leu 


Thr 


Pro 


Met 






955 










960 


Arg 


Asp 


Leu 


Ala 


Val 


Ala 


Val 


Glu 




970 










975 




LyS 


Lys 


Val 


lie 


Val 


Tro 


Gly Ala 


985 










990 






Leu 


His 


Gly 


Leu 


Pro 


Val 


Ser 


Ala 








1005 








Gly 


Pro 


Ala 


Asp 


Gly 


Tyr 


Thr 


Ser 






1020 










He 


Thr 


Ala 


Tyr Ala 


Gin 


Gin 


Thr 




1035 








1040 


Val 


Ser 


Met 


Thr 


Gly 


Arg 


Asp 


Arg 


1050 








1055 




He 


Leu 


Ser 


Thr 


Val 


Ser 


Gin 


Ser 



065 1070 
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Phe Leu Gly Thr Thr He Ser Gly Val Leu Trp Thr Val Tyr His Gly 

1075 1080 1085 

Ala Gly Asn Lys Thr Leu Ala Gly Leu Arg Gly Pro Val Thr Gin Met 

1090 ~ 1095 " 1100 

Tyr Ser Ser Ala Glu Gly Asp Leu Val Gly Trp Pro Ser Pro Pro Gly 
1105 1110 1115 1120 

Thr Lys Ser Leu Glu Pro Cys Lys Cys Gly Ala Val Asp Leu Tyr Leu 

1125 1130 1135 

Val Thr Arg Asn Ala Asp Val He Pro Ala Arg Arg Arg Gly Asp Lys 

1140 1145 1150 

Arg Gly Ala Leu Leu Ser Pro Arg Pro He Ser Thr Leu Lys Gly Ser 

1155 1160 1165 

Ser Gly Gly Pro Val Leu Cys Pro Arg Gly His Val Val Gly Leu Phe 

1170 1175 1180 

Arg Ala Ala Val Cys Ser Arg Gly Val Ala Lys Ser lie Asp Phe He 
1185 1190 1195 1200 

Pro Val Glu Thr Leu Asp Val Val Thr Arg Ser Pro Thr Phe Ser Asp 

1205 1210 1215 

Asn Ser Thr Pro Pro Ala Val Pro Gin Thr Tyr Gin Val Gly Tyr Leu 

1220 1225 1230 

His Ala Pro Thr Gly Ser Gly Lys Ser Thr Lys Val Pro Val Ala Tyr 

1235 1240 1245 

Ala Ala Gin Gly Tyr Lys Val Leu Val Leu Asn Pro Ser Val Ala Ala 

1250 1255 1260 

Thr Leu Gly Phe Gly Ala Tyr Leu Ser Lys Ala His Gly He Asn Pro 
1265 1270 1275 1280 

Asn He Arg Thr Gly Val Arg Thr Val Met Thr Gly Glu Ala He Thr 

1285 1290 1295 

Tyr Ser Thr Tyr Gly Lys Phe Leu Ala Asp Gly Gly Cys Ala Ser Gly 

1300 1305 1310 

Ala Tyr Asp He He He Cys Asp Glu Cys His Ala Val Asp Ala Thr 

1315 1320 ' 1325 

Ser He Leu Gly He Gly Thr Val Leu Asp Gin Ala Glu Thr Ala Gly 

1330 1335 1340 

Val Arg Leu Thr Val Leu Ala Thr Ala Thr Pro Pro Gly Ser Val Thr 
1345 1350 1355 1360 

Thr Pro His Pro Asp He Glu Glu Val Gly Leu Gly Arg Glu Gly Glu 

1365 1370 1375 

He Pro Phe Tyr Gly Arg Ala He Pro Leu Ser Cys He Lys Gly Gly 

1380 1385 1390 

Arg His Leu He Phe Cys His Ser Lys Lys Lys Cys Asp Glu Leu Ala 
1395 1400 - " 14Q5 

Ala Ala Leu Arg Gly Met Gly Leu Asn Ala Val Ala Tyr Tyr Arg Gly 

1410 1415 1420 

Leu Asp Val Ser He He Pro Ala Gin Gly Asp Val Val Val Val Ala 
1425 1430 1435 1440 

Thr Asp Ala Leu Met Thr Gly Tyr Thr Gly Asp Phe Asp Ser Val He 

1445 1450 1455 

Asp Cys Asn Val Ala Val Thr Gin Ala Val Asp Phe Ser Leu Asp Pro 

1460 1465 1470 

Thr Phe Thr He Thr Thr Gin Thr Val Pro Gin Asp Ala Val Ser Arg 

1475 1480 1485 

Ser Gin Arg Arg Gly Arg Thr Gly Arg Gly Arg Gin Gly Thr Tyr Arg 

1490 1495 1500 

Tyr Val Ser Thr Gly Glu Arg Ala Ser Gly Met Phe Asp Ser Val Val 
1505 1510 1515 1520 

Leu Cys Glu Cys Tyr Asp Ala Gly Ala Ala Trp Tyr Asp Leu Thr Pro 



23 



1525 1530 1535 

Ala Glu Thr Thr Val Arg Leu Arg Ala Tyr Phe Asn Thr Pro Gly Leu 

1540 1545 1550 

Pro Val Cys Gin Asp His Leu Glu Phe Trp Glu Ala Val Phe Thr Gly 

1555 1560 1565 

Leu Thr His lie Asp Ala His Phe Leu Ser Gin Thr Lys Gin Ala Gly 

1570 1575 1580 

Glu Asn Phe Ala Tyr Leu Val Ala Tyr Gin Ala Thr Val Cys Ala Arg 
1585 1590 1595 1600 

Ala Lys Ala Pro Pro Pro Ser Trp Asp Ala Met Trp Lys Cys Leu Ala 

1605 1610 1615 

Arg Leu Lys Pro Thr Leu Ala Gly Pro Thr Pro Leu Leu Tyr Arg Leu 

1620 1625 1630 

Gly Pro lie Thr Asn Glu Val Thr Leu Thr His Pro Gly Thr Lys Tyr 

1635 1640 1645 

He Ala Thr Cys Met Gin Ala Asp Leu Glu Val Met Thr Ser Thr Trp 

1650 1655 1660 

Val Leu Ala Gly Gly Val Leu Ala Ala Val Ala Ala Tyr Cys Leu Ala 
1665 1670 1675 1680 

Thr Gly Cys Val Ser He He Gly Arg Leu His Val Asn Gin Arg Val 

1685 1690 1695 

Val Val Ala Pro Asp Lys Glu Val Leu Tyr Glu Ala Phe Asp Glu Met 

1700 1705 1710 

Glu Glu Cys Ala Ser Arg Ala Ala Leu He Glu Glu Gly Gin Arg He 

1715 1720 1725 

Ala Glu Met Leu Lys Ser Lys lie Gin Gly Leu Leu Gin Gin Ala Ser 

1730 1735 1740 

Lys Gin Ala Gin Asp He Gin Pro Ala Met Gin Ala Ser Trp Pro Lys 
1745 1750 1755 1760 

Val Glu Gin Phe Trp Ala Arg His Met Trp Asn Phe He Ser Gly He 

1765 1770 1775 

Gin Tyr Leu Ala Gly Leu Ser Thr Leu Pro Gly Asn Pro Ala Val Ala 

1780 1785 1790 

Ser Met Met Ala Phe Ser Ala Ala Leu Thr Ser Pro Leu Ser Thr Ser 

1795 1800 • 1805 

Thr Thr He Leu Leu Asn He Met Gly Gly Trp Leu Ala Ser Gin He 

1810 1815 1820 

Ala Pro Pro Ala Gly Ala Thr Gly Phe Val Val Ser Gly Leu Val Gly 
1825 1830 1835 1840 

Ala Ala Val Gly Ser He Gly Leu Gly Lys Val Leu Val Asp He Leu 

1845 1850 1855 

Ala Gly Tyr Gly Ala Gly He Ser Gly Ala Leu Val Ala Phe Lys He 

1860 1865 1870 

Met Ser Gly Glu Lys Pro Ser Met Glu Asp Val He Asn Leu Leu Pro 

1875 1880 1885 

Gly He Leu Ser Pro Gly Ala Leu Val Val Gly Val He Cys Ala Ala 

1890 1895 1900 

He Leu Arg Arg His Val Gly Pro Gly Glu Gly Ala Val Gin Trp Met 
1905 1910 1915 1920 

Asn Arg Leu He Ala Phe Ala Ser Arg Gly Asn His Val Ala Pro Thr 

1925 1930 1935 

His Tyr Val Thr Glu Ser Asp Ala Ser Gin Arg Val Thr Gin Leu Leu 

1940 1945 1950 

Gly Ser Leu Thr He Thr Ser Leu Leu Arg Arg Leu His Asn Trp He 

1955 1960 1965 

Thr Glu Asp Cys Pro He Pro Cys Ser Gly Ser Trp Leu Arg Asp Val 
1970 1975 1980 
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Trp Asp Trp Val Cys Thr lie Leu Thr Asp Phe Lys Asn Trp Leu Thr 
1985 1990 1995 2000 

Ser Lys Leu Phe Pro Lys Leu Pro Gly Leu Pro Phe lie Ser Cys Gin 

2005 2010 2015 

Lys Gly Tyr Lys Gly Val Trp Ala Gly Thr Gly lie Met Thr Thr Arg 

2020 2025 2030 

Cys Pro Cys Gly Ala Asn lie Ser Gly Asn Val Arg Leu Gly Ser Met 

2035 2040 2045 

Arg lie Thr Gly Pro Lys Thr Cys Met Asn Thr Trp Gin Gly Thr Phe 

2050 2055 2060 

Pro lie Asn Cys Tyr Thr Glu Gly Gin Cys Ala Pro Lys Pro Pro Thr 
2065 2070 2075 2080 

Asn Tyr Lys Thr Ala lie Trp Arg Val Ala Ala Ser Glu Tyr Ala Glu 

2085 2090 2095 

Val Thr Gin His Gly Ser Tyr Ser Tyr Val Thr Gly Leu Thr Thr Asp 

2100 " " 2105 " 2110 

Asn Leu Lys lie Pro Cys Gin Leu Pro Ser Pro Glu Phe Phe Ser Trp 

2115 2120 2125 

Val Asp Gly Val Gin lie His Arg Phe Ala Pro Thr Pro Lys Pro Phe 
2130 2135 2140 



Phe Arg Asp Glu Val Ser Phe Cys Val Gly Leu Asn Ser Tyr Ala Val 
2145 2150 2155 2160 

Gly Ser Gin Leu Pro Cys Glu Pro Glu Pro Asp Ala Asp Val Leu Arg 

2165 2170 2175 

Ser Met Leu Thr Asp Pro Pro His lie Thr Ala Glu Thr Ala Ala Arg 

2180 2185 2190 

Arg Leu Ala Arg Gly Ser Pro Pro Ser Glu Ala Ser Ser Ser Val Ser 

2195 2200 2205 

Gin Leu Ser Ala Pro Ser Leu Arg Ala Thr Cys Thr Thr His Ser Asn 

2210 2215 2220 

Thr Tyr Asp Val Asp Met Val Asp Ala Asn Leu Leu Met Glu Gly Gly 
2225 2230 2235 2240 

Val Ala Gin Thr Glu Pro Glu Ser Arg Val Pro Val Leu Asp Phe Leu 

2245 2250 2255 

Glu Pro Met Ala Glu Glu Glu Ser Asp Leu Glu Pro Ser lie Pro Ser 

2260 2265 2270 

Glu Cys Met Leu Pro Arg Ser Gly Phe Pro Arg Ala Leu Pro Ala Trp 

2275 2280 2285 

Ala Arg Pro Asp Tyr Asn Pro Pro Leu Val Glu Ser Trp Arg Arg Pro 

2290 ^ 2295 2300 

Asp Tyr Gin Pro Pro Thr Val Ala Gly Cys Ala Leu Pro Pro Pro Lys 
2305 " 2310 ' 2315 2320 

Lys Ala Pro Thr Pro Pro Pro Arg Arg Arg Arg Thr Val Gly Leu Ser 

2325 2330 2335 

Glu Ser Thr He Ser Glu Ala Leu Gin Gin Leu Ala He Lys Thr Phe 

2340 2345 2350 

Gly Gin Pro Pro Ser Ser Gly Asp Ala Gly Ser Ser Thr Gly Ala Gly 

2355 2360 ' 2365 

Ala Ala Glu Ser Gly Gly Pro Thr Ser Pro Gly Glu Pro Ala Pro Ser 

2370 2375 2380 

Glu Thr Gly Ser Ala Ser Ser Met Pro Pro Leu Glu Gly Glu Pro Gly 
2385 2390 2395 2400 

Asp Pro Asp Leu Glu Ser Asp Gin Val Glu Leu Gin Pro Pro Pro Gin 

2405 2410 2415 

Gly Gly Gly Val Ala Pro Gly Ser Gly Ser Gly Ser Trp Ser Thr Cys 
2420 2425 2430 
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Ser Glu Glu Asp Asp Thr Thr Val Cys Cys Ser Met Ser Tyr Ser Trp 

2435 2440 2445 

Thr Gly Ala Leu lie Thr Pro Cys Ser Pro Glu Glu Glu Lys Leu Pro 

2450 2455 2460 

lie Asn Pro Leu Ser Asn Ser Leu Leu Arg Tyr His Asn Lys Val Tyr 
2465 2470 2475 2480 

Cys Thr Thr Ser Lys Ser Ala Ser Gin Arg Ala Lys Lys Val Thr Phe 

2485 2490 2495 

Asp Arg Thr Gin Val Leu Asp Ala His Tyr Asp Ser Val Leu Lys Asp 

2500 2505 2510 

lie Lys Leu Ala Ala Ser Lys Val Ser Ala Arg Leu Leu Thr Leu Glu 

2515 2520 2525 

Glu Ala Cys Gin Leu Thr Pro Pro His Ser Ala Arg Ser Lys Tyr Gly 

2530 2535 2540 

Phe Gly Ala Lys Glu Val Arg Ser Leu Ser Gly Arg Ala Val Asn His 
2545 2550 2555 2560 

lie Lys Ser Val Trp Lys Asp Leu Leu Glu Asp Pro Gin Thr Pro lie 

2565 ~ 2570 2575 

Pro Thr Thr lie Met Ala Lys Asn Glu Val Phe Cys Val Asp Pro Ala 

2580 2585 2590 

Lys Gly Gly Lys Lys Pro Ala Arg Leu lie Val Tyr Pro Asp Leu Gly 

2595 ^ ^ 2600 2605 

Val Arg Val Cys Glu Lys Met Ala Leu Tyr Asp lie Thr Gin Lys Leu 

2610 2615 2620 

Pro Gin Ala Val Met Gly Ala Ser Tyr Gly Phe Gin Tyr Ser Pro Ala 
2625 2630 2635 2640 

Gin Arg Val Glu Tyr Leu Leu Lys Ala Trp Ala Glu Lys Lys Asp Pro 

2645 ^ 2650 2655 

Met Gly Phe Ser Tyr Asp Thr Arg Cys Phe Asp Ser Thr Val Thr Glu 

2660 2665 2670 

Arg Asp lie Arg Thr Glu Glu Ser lie Tyr Gin Ala Cys Ser Leu Pro 

2675 2680 2685 

Glu Glu Ala Arg Thr Ala lie His Ser Leu Thr Glu Arg Leu Tyr Val 

2690 2695 2700 

Gly Gly Pro Met Phe Asn Ser Lys Gly Gin Thr Cys Gly Tyr Arg Arg 
2705 2710 2715 2720 

Cys Arg Ala Ser Gly Val Leu Thr Thr Ser Met Gly Asn Thr lie Thr 

2725 2730 2735 

Cys Tyr Val Lys Ala Leu Ala Ala Cys Lys Ala Ala Gly lie Val Ala 

2740 2745 2750 

Pro Thr Met Leu Val Cys Gly Asp Asp Leu Val Val He Ser Glu Ser 

2755 2760 2765 

Gin Gly Thr Glu Glu Asp Glu Arg Asn Leu Arg Ala Phe Thr Glu Ala 

2770 2775 2780 

Met Thr Arg Tyr Ser Ala Pro Pro Gly Asp Pro Pro Arg Pro Glu Tyr 
2785 2790 2795 2800 

Asp Leu Glu Leu He Thr Ser Cys Ser Ser Asn Val Ser Val Ala Leu 

2805 2810 2815 

Gly Pro Arg Gly Arg Arg Arg Tyr Tyr Leu Thr Arg Asp Pro Thr Thr 

2820 " " 2825 2830 

Pro Leu Ala Arg Ala Ala Trp Glu Thr Val Arg His Ser Pro He Asn 

2835 2840 2845 

Ser Trp Leu Gly Asn He He Gin Tyr Ala Pro Thr He Trp Val Arg 

2850 2855 2860 

Met Val Leu Met Thr His Phe Phe Ser He Leu Met Val Gin Asp Thr 
2865 2870 2875 2880 

Leu Asp Gin Asn Leu Asn Phe Glu Met Tyr Gly Ser Val Tyr Ser Val 
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2885 2890 2895 

Asn Pro Leu Asp Leu Pro Ala lie lie Glu Arg Leu His Gly Leu Asp 

2900 2905 2910 

Ala Phe Ser Met His Thr Tyr Ser His His Glu Leu Thr Arg Val Ala 

2915 2920 2925 

Ser Ala Leu Arg Lys Leu Gly Ala Pro Pro Leu Arg Val Trp Lys Ser 

2930 2935 2940 

Arg Ala Arg Ala Val Arg Ala Ser Leu lie Ser Arg Gly Gly Lys Ala 
2945 2950 2955 2960 

Ala Val Cys Gly Arg Tyr Leu Phe Asn Trp Ala Val Lys Thr Lys Leu 

2965 2970 2975 

Lys Leu Thr Pro Leu Pro Glu Ala Arg Leu Leu Asp Leu Ser Ser Trp 

2980 2985 2990 

Phe Thr Val Gly Ala Gly Gly Gly Asp He Phe His Ser Val Ser Arg 

2995 3000 3005 

Ala Arg Pro Arg Ser Leu Leu Phe Gly Leu Leu Leu Leu Phe Val Gly 

3010 3015 "* 3020 

Val Gly Leu Phe Leu Leu Pro Ala Arg 
3025 3030 



<210> 5 

<211> 9674 

<212> DNA 

<213> Hepatitis C virus 

<220> 
<221> CDS 

<222> (341) . . (9442) 



<400> 5 
acccgcccct 


aataggggcg 


acactccgcc 


atgaatcact 


cccctgtgag 


gaactactgt 


60 


cttcacgcag 


aaagcgtcta 


gccatggcgt 


tagtatgagt 


gtcgtacagc 


ctccaggccc 


120 


ccccctcccg 


ggagagccat 


agtggtctgc 


ggaaccggtg 


agtacaccgg 


aattgccggg 


180 


aagactgggt 


cctttcttgg 


ataaacccac 


tctatgcccg 


gccatttggg 


cgtgcccccg 


240 


caagactgct 


agccgagtag 


cgttgggttg 


cgaaaggcct 


tgtggtactg 


cctgataggg 


300 


tgcttgcgag 


tgccccggga 


ggtctcgtag 


accgtgcacc 


atg age aca aat ccc 
Met Ser Thr Asn Pro 


355 



aaa cct caa aga aaa acc aaa aga aac act aac cgt cgc cca caa gac 403 

Lys Pro Gin Arg Lys Thr Lys Arg Asn Thr Asn Arg Arg Pro Gin Asp 

10 " 15 20 

gtt aag ttt ccg ggc ggc ggc cag ate gtt ggc gga gta tac ttg ttg 451 

Val Lys Phe Pro Gly Gly Gly Gin He Val Gly Gly Val Tyr Leu Leu 

25 30 35 

ccg cgc agg ggc ccc agg ttg ggt gtg cgc gcg aca agg aag get teg 4 99 

Pro Arg Arg Gly Pro Arg Leu Gly Val Arg Ala Thr Arg Lys Ala Ser 

40 45 50 
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gag egg tec cag cca cgt ggg agg cgc cag ccc ate ccc aaa cat egg 547 

Glu Arg Ser Gin Pro Arg Gly Arg Arg Gin Pro lie Pro Lys His Arg 
55 60 " 65 

cgc tec act ggc aag tec tgg ggg aag cca gga tac ccc tgg ccc ctg 595 

Arg Ser Thr Gly Lys Ser Trp Gly Lys Pro Gly Tyr Pro Trp Pro Leu 

70 75 80 85 

tat ggg aat gag ggg etc ggt tgg gca gga tgg etc ctg tec cct cga 64 3 

Tyr Gly Asn Glu Gly Leu Gly Trp Ala Gly Trp Leu Leu Ser Pro Arg 

90 95 100 

ggt tec cgt ccc tea tgg ggc ccc aat gac ccc egg cat agg teg cgc 691 

Gly Ser Arg Pro Ser Trp Gly Pro Asn Asp Pro Arg His Arg Ser Arg 
105 110 115 

aat gtg ggt aag gtc ate gat acc eta acg tgc ggc ttt gee gac etc 739 

Asn Val Gly Lys Val lie Asp Thr Leu Thr Cys Gly Phe Ala Asp Leu 
120 125 ' 130 

ttg ggg tac gtc ccc gtc gta ggc gee ccg ctt agt ggc gtt gee agt 787 

Leu Gly Tyr Val Pro Val Val Gly Ala Pro Leu Ser Gly Val Ala Ser 
135 140 145 

get etc gcg cac ggc gtg aga gtc ctg gag gac ggg gtt aat ttt gca 835 

Ala Leu Ala His Gly Val Arg Val Leu Glu Asp Gly Val Asn Phe Ala 

150 155 160 165 

aca ggg aac tta cct ggt tgc tec ttt tct ate ttc ttg ctg gee eta 883 

Thr Gly Asn Leu Pro Gly Cys Ser Phe Ser lie Phe Leu Leu Ala Leu 

170 175 180 

ctg tec tgc ate act act ccg gtc tct get gtc caa gtg aag aac acc 931 

Leu Ser Cys lie Thr Thr Pro Val Ser Ala Val Gin Val Lys Asn Thr 
185 190 195 

age aac gee tat atg gcg act aac gac tgt tec aat gac age ate act 979 

Ser Asn Ala Tyr Met Ala Thr Asn Asp Cys Ser Asn Asp Ser lie Thr 
200 205 210 

tgg cag ctt gag gec gca gtc etc cat gtc ccc ggg tgc gtc ccg tgc 1027 

Trp Gin Leu Glu Ala Ala Val Leu His Val Pro Gly Cys Val Pro Cys 
215 220 225 

gag aaa atg ggg aac aca tea egg tgc tgg ata cca gtc tea cca aac 1075 

Glu Lys Met Gly Asn Thr Ser Arg Cys Trp lie Pro Val Ser Pro Asn 

230 235 240 245 

gtg get gtg egg cag cct ggc gee etc acg egg ggc ttg egg acg cac 1123 

Val Ala Val Arg Gin Pro Gly Ala Leu Thr Arg Gly Leu Arg Thr His 

250 255 260 

ate gac atg gtc gtg ttg tec gec acg etc tgc tec get etc tac gtg 1171 

lie Asp Met Val Val Leu Ser Ala Thr Leu Cys Ser Ala Leu Tyr Val 
265 270 275 

ggg gac etc tgt ggc ggg gtg atg etc gcg tec cag atg ttc att gtc 1219 
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Gly Asp Leu 
280 

teg ccg cag 
Ser Pro Gin 
295 

cct ggc gec 
Pro Gly Ala 
310 

tgg teg ccc 
Trp Ser Pro 



gag gtc ate 
Glu Val He 



ggc ctg gec 
Gly Leu Ala 
360 

ate etc ctg 
He Leu Leu 
375 

age get get 
Ser Ala Ala 
390 

ggc get egg 
Gly Ala Arg 



ate aac cgc 
He Asn Arg 



ttc acg gee 

Phe Thr Ala 
440 

gag cgc ctg 

Glu Arg Leu 
455 

ggc gee ctg 

Gly Ala Leu 
470 

cca tat tgc 

Pro Tyr Cys 



ggg ace gtg 
Gly Thr Val 



Cys Gly Gly 



cac cac tgg 
His His Trp 



ate act ggg 
He Thr Gly 
315 

acg ace ace 
Thr Thr Thr 
330 

ata gac ate 
He Asp He 
345 

tac ttc tct 
Tyr Phe Ser 



ctg gee tct 
Leu Ala Ser 



ggg cgc act 
Gly Arg Thr 
395 

cag aac att 
Gin Asn He 
410 

ace gee ctg 
Thr Ala Leu 
425 

ctg ttc tac 
Leu Phe Tyr 



tec gee tgt 
Ser Ala Cys 



caa tac gac 
Gin Tyr Asp 
475 

tgg cac tac 
Trp His Tyr 
4 90 

tgc ggc cca 
Cys Gly Pro 



Val Met Leu 
285 

ttc gtg cag 
Phe Val Gin 
300 

cac cgt atg 
His Arg Met 



atg ate ctg 
Met He Leu 



att age gga 
He Ser Gly 
350 

atg cag gga 
Met Gin Gly 
365 

ggg gtg gac 
Gly Val Asp 
380 

ace agt age 
Thr Ser Ser 



cag etc att 
Gin Leu He 



aat tgc aac 
Asn Cys Asn 
4 30 

ate cat aag 
He His Lys 
4 4 5 

cgc aac ate 
Arg Asn He 
460 

gac aat gtc 
Asp Asn Val 



cca cca aaa 
Pro Pro Lys 



gtg tac tgt 
Val Tyr Cys 



Ala Ser Gin 



gaa tgc aat 
Glu Cys Asn 
305 

gca tgg gac 
Ala Trp Asp 
320 

gcg tac gtg 
Ala Tyr Val 
335 

get cac tgg 
Ala His Trp 



gcg tgg gcg 
Ala Trp Ala 



gcg tac ace 
Ala Tyr Thr 
385 

ctg gee age 
Leu Ala Ser 
400 

aat acc aat 
Asn Thr Asn 
415 

gat tec ttg 
Asp Ser Leu 



ttc aac teg 
Phe Asn Ser 



gag gac ttc 
Glu Asp Phe 
465 

acc aat cca 
Thr Asn Pro 
480 

cag tgt ggc 
Gin Cys Gly 
495 

ttc acc cct 
Phe Thr Pro 



Met Phe He 
290 

tgc tec ate 
Cys Ser He 



atg atg atg 
Met Met Met 



atg cgc gtt 
Met Arg Val 
340 

ggc gtc atg 
Gly Val Met 
355 

aag gtc gtt 
Lys Val Val 
370 

acc acg act 
Thr Thr Thr 



gee ttc tec 
Ala Phe Ser 



ggt age tgg 
Gly Ser Trp 
420 

cac acc ggc 
His Thr Gly 
435 

teg gga tgt 
Ser Gly Cys 
450 

egg ata gga 
Arg He Gly 



gaa gat atg 
Glu Asp Met 



gta gtc ccc 
Val Val Pro 
500 

age ccg gtg 
Ser Pro Val 



Val 



tac 1267 
Tyr 



aac 1315 

Asn 

325 

ccc 1363 
Pro 



ttt 1411 
Phe 



gtc 1459 
Val 



ggg 1507 
Gly 



cct 1555 

Pro 

405 

cac 1603 
His 



ttc 1651 
Phe 



ccc 1699 
Pro 



tgg 1747 
Trp 



agg 1795 

Arg 

485 

gca 1843 
Ala 



gta 1891 
Val 
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505 



510 



515 



gtg ggc acg acc gat aga ctt gga gtg cct act tac acg tgg gga gag 1939 
Val Gly Thr Thr Asp Arg Leu Gly Val Pro Thr Tyr Thr Trp Gly Glu 
520 525 530 

aat gag aca gat gtc ttc eta ttg aac age acc cga cca ccg teg ggg 1987 
Asn Glu Thr Asp Val Phe Leu Leu Asn Ser Thr Arg Pro Pro Ser Gly 
535 540 545 

tea tgg ttt ggc tgc acg tgg atg aac tec act ggc ttc acc aag acc 2035 
Ser Trp Phe Gly Cys Thr Trp Met Asn Ser Thr Gly Phe Thr Lys Thr 
550 " 555 560 ' 5 65 

tgc ggc gca cca ccc tgc cgc act aga get gac ttc aat acc age aca 2083 
Cys Gly Ala Pro Pro Cys Arg Thr Arg Ala Asp Phe Asn Thr Ser Thr 
570 575 580 

gat ctg ttg tgc ccc acg gac tgt ttt aga aaa cat cct gaa gee act 2131 
Asp Leu Leu Cys Pro Thr Asp Cys Phe Arg Lys His Pro Glu Ala Thr 
585 5 90 " 595 

tac ate aaa tgt ggt tec ggg cct tgg etc acg cca aag tgt ctg gtt 2179 
Tyr lie Lys Cys Gly Ser Gly Pro Trp Leu Thr Pro Lys Cys Leu Val 
600 605 610 

gac tac ccc tac agg etc tgg cat tac cct tgc aca gtc aat tac tec 2227 
Asp Tyr Pro Tyr Arg Leu Trp His Tyr Pro Cys Thr Val Asn Tyr Ser 
615 620 625 

acc ttc aag ate agg atg tat gtg ggg gga gtt gag cac agg etc atg 2275 
Thr Phe Lys lie Arg Met Tyr Val Gly Gly Val Glu His Arg Leu Met 
630 635 ^ 640 ~ 645 

gee gcg tgc aat ttc act cgt ggg gat cgc tgc aac ttg gag gat agg 2323 
Ala Ala Cys Asn Phe Thr Arg Gly Asp Arg Cys Asn Leu Glu Asp Arg 
650 655 660 

gac aga agt caa cag act cct ctg ttg cac tec acc acg gaa tgg gee 2371 
Asp Arg Ser Gin Gin Thr Pro Leu Leu His Ser Thr Thr Glu Trp Ala 
665 670 675 

att ttg ccc tgc tct ttc tea gac ttg ccc get ttg teg act ggt ctt 2419 
lie Leu Pro Cys Ser Phe Ser Asp Leu Pro Ala Leu Ser Thr Gly Leu 
680 685 690 

etc cac etc cac caa aat ate gtg gac gta caa tat atg tat ggc ctg 24 67 
Leu His Leu His Gin Asn lie Val Asp Val Gin Tyr Met Tyr Gly Leu 
695 700 705 

tea cct gee etc aca caa tat ate gtt cga tgg gag tgg gta gta etc 2515 
Ser Pro Ala Leu Thr Gin Tyr lie Val Arg Trp Glu Trp Val Val Leu 
710 715 720 725 

tta ttc ctg etc eta gcg gac gee agg gtc tgc gee tgc ttg tgg atg 2563 
Leu Phe Leu Leu Leu Ala Asp Ala Arg Val Cys Ala Cys Leu Trp Met 
730 735 ' 740 
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etc ate ttg 
Leu lie Leu 



ttg cac get 
Leu His Ala 
760 

ate ttt etc 
lie Phe Leu 
775 

get get tat 
Ala Ala Tyr 
7 90 

gca ctg ccc 
Ala Leu Pro 



gtg ggc gcg 
Val Gly Ala 



ggg tat aag 
Gly Tyr Lys 
840 

ctg acc ctg 
Leu Thr Leu 
855 

gcg cgc ggc 
Ala Arg Gly 
870 

ccg ggc gta 
Pro Gly Val 



cct ggt tac 
Pro Gly Tyr 



aga gee cac 
Arg Ala His 
920 

ggg ggt agg 
Gly Gly Arg 
935 

ggc act tac 
Gly Thr Tyr 
950 



ctg ggc caa 
Leu Gly Gin 
745 

gcg age gca 
Ala Ser Ala 



gtg get get 
Val Ala Ala 



tec ctt act 
Ser Leu Thr 
795 

cag cag get 
Gin Gin Ala 
810 

get ttg eta 
Ala Leu Leu 

825 

acc ctt etc 
Thr Leu Leu 



gcg gaa acc 
Ala Glu Thr 



ggc cgt gat 
Gly Arg Asp 
875 

gtg ttt gac 
Val Phe Asp 
8 90 

etc eta aga 
Leu Leu Arg 
905 

get ctg ctg 
Ala Leu Leu 



tac gtc cag 
Tyr Val Gin 



ate tat gac 
lie Tyr Asp 
955 



gee gaa gca 
Ala Glu Ala 
750 

get age tgc 
Ala Ser Cys 
765 

tgg cac ate 
Trp His lie 
780 

ggc ctg tgg 
Gly Leu Trp 



tac gee tat 
Tyr Ala Tyr 



gta ctg att 
Val Leu He 
830 

age cag tec 
Ser Gin Ser 
845 

atg gtc cag 
Met Val Gin 
860 

ggc ate ata 
Gly He He 



ata acc aag 
He Thr Lys 



ggt get ttg 
Gly Ala Leu 
910 

aga atg tgc 
Arg Met Cys 
925 

atg gcg eta 
Met Ala Leu 
940 

cac etc acc 
His Leu Thr 



gca ctg gag 
Ala Leu Glu 



aat ggc ttc 
Asn Gly Phe 



aag ggt agg 
Lys Gly Arg 
785 

ccg ttc tgc 
Pro Phe Cys 
800 

gat gca tct 
Asp Ala Ser 
815 

acc etc ttt 
Thr Leu Phe 



ctg tgg tgg 
Leu Trp Trp 



gag tgg gca 
Glu Trp Ala 
865 

tgg gee gee 
Trp Ala Ala 
880 

tgg etc tta 
Trp Leu Leu 
895 

acg cgc gtg 
Thr Arg Val 



act atg gtg 
Thr Met Val 



tta gee ctt 
Leu Ala Leu 
945 

cct atg teg 
Pro Met Ser 
960 



aag ctg gtc 
Lys Leu Val 
755 

ctg tat ttt 
Leu Tyr Phe 
770 

gtg gtc ccc 
Val Val Pro 



eta ctg etc 
Leu Leu Leu 



gtg cac gga 
Val His Gly 
820 

aca etc acc 
Thr Leu Thr 
835 

ttg tgc tat 
Leu Cys Tyr 
850 

cca tec atg 
Pro Ser Met 



acc ata ttt 
Thr He Phe 



gcg gtg ctt 

Ala Val Leu 
900 

cca tat ttc 

Pro Tyr Phe 
915 

agg cac etc 

Arg His Leu 

.930 

ggc agg tgg 

Gly Arg Trp 



gat tgg get 
Asp Trp Ala 



gtc 2611 
Val 



gtc 2659 
Val 



ttg 2707 
Leu 



eta 2755 

Leu 

805 

cag 2803 
Gin 



ccg 2851 
Pro 



etc 2899 
Leu 



cag 2947 
Gin 



tgc 2995 

Cys 

885 

ggg 3043 
Gly 



gtc 3091 
Val 



gcg 3139 
Ala 



act 3187 
Thr 



get 3235 

Ala 

965 
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age ggc ctg egg gac ttg gcg gtc get gtg gag cct ate ate ttc agt 3283 
Ser Gly Leu Arg Asp Leu Ala Val Ala Val Glu Pro He He Phe Ser 
970 975 980 

ccg atg gag aag aaa gtc ate gtt tgg gga gcg gag acg get gcg tgc 3331 
Pro Met Glu Lys Lys Val He Val Trp Gly Ala Glu Thr Ala Ala Cys 
985 990 995 

ggg gac ate ttg cac gga ctt ccc gtg tec gee cga etc ggt egg gag 3379 
Gly Asp He Leu His Gly Leu Pro Val Ser Ala Arg Leu Gly Arg Glu 
1000 1005 1010 

ate etc ctt ggc cca get gat ggc tac acc tec aag ggg tgg aag ctt 3427 
He Leu Leu Gly Pro Ala Asp Gly Tyr Thr Ser Lys Gly Trp Lys Leu 
1015 1020 1025 

etc gec ccc ate acc get tac gec cag cag aca cga ggt etc ttg ggc 3475 
Leu Ala Pro He Thr Ala Tyr Ala Gin Gin Thr Arg Gly Leu Leu Gly 
1030 1035 1040 1045 

tct ata gtg gtg age atg acg ggg cgt gac aag aca gaa cag gee ggg 3523 
Ser He Val Val Ser Met Thr Gly Arg Asp Lys Thr Glu Gin Ala Gly 
1050 1055 1060 

gag gtc caa gtc ctg tec aca gtc act cag tec ttc etc gga aca tec 3571 
Glu Val Gin Val Leu Ser Thr Val Thr Gin Ser Phe Leu Gly Thr Ser 
1065 1070 1075 

att teg ggg gtc tta tgg act gtt tac cac gga get ggc aac aag aca 3619 
He Ser Gly Val Leu Trp Thr Val Tyr His Gly Ala Gly Asn Lys Thr 
1080 1085 1090 

eta gec ggc teg egg ggc ccg gtc acg cag atg tac teg age gee gag 3667 
Leu Ala Gly Ser Arg Gly Pro Val Thr Gin Met Tyr Ser Ser Ala Glu 
1095 1100 1105 

ggg gac ttg gtc ggg tgg ccc age cct cct ggg acc aaa tct ttg gag 3715 
Gly Asp Leu Val Gly Trp Pro Ser Pro Pro Gly Thr Lys Ser Leu Glu 
1110 ^ 1115 1120 1125 

ccg tgt acg tgt gga gcg gtc gac ctg tat ttg gtc acg egg aac get 3763 
Pro Cys Thr Cys Gly Ala Val Asp Leu Tyr Leu Val Thr Arg Asn Ala 
1130 1135 1140 

gat gtc ate ccg get cga aga cgc ggg gac aag egg gga gcg ctg etc 3811 
Asp Val He Pro Ala Arg Arg Arg Gly Asp Lys Arg Gly Ala Leu Leu 
1145 1150 1155 

tec ccg aga ccc ctt teg acc ttg aag ggg tec teg ggg gga cct gtg 3859 
Ser Pro Arg Pro Leu Ser Thr Leu Lys Gly Ser Ser Gly Gly Pro Val 
1160 1165 1170 

ctt tgc cct agg ggc cac get gtc gga ate ttc egg gca get gtg tgc 3907 
Leu Cys Pro Arg Gly His Ala Val Gly He Phe Arg Ala Ala Val Cys 
1175 1180 1185 

tct egg ggt gtg get aag tec ata gat ttc ate ccc gtt gag acg etc 3955 
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Ser Arg Gly Val Ala Lys Ser lie Asp Phe He Pro Val Glu Thr Leu 
1190 ^ ' 1195 1200 1205 



gac ate gtc acg egg tct ccc acc ttt agt gac aac age aca cca cca 4003 
Asp He Val Thr Arg Ser Pro Thr Phe Ser Asp Asn Ser Thr Pro Pro 
1210 1215 1220 

get gtg ccc cag acc tat cag gtg ggg tac ttg cac gec ccc act ggc 4051 
Ala Val Pro Gin Thr Tyr Gin Val Gly Tyr Leu His Ala Pro Thr Gly 
1225 1230 1235 



agt gga aaa age acc aag gtc ccc gtc gcg tac gec gec cag ggg tat 4099 
Ser Gly Lys Ser Thr Lys Val Pro Val Ala Tyr Ala Ala Gin Gly Tyr 
1240 1245 1250 

aaa gtg ctg gtg etc aat ccc teg gtg get gec acc ctg gga ttt ggg 4147 
Lys Val Leu Val Leu Asn Pro Ser Val Ala Ala Thr Leu Gly Phe Gly 
1255 1260 1265 

gcg tac ttg tec aag gca cat ggc ate aac ccc aac att agg act gga 4195 
Ala Tyr Leu Ser Lys Ala His Gly He Asn Pro Asn He Arg Thr Gly 
1270 " J 1275 1280 1285 

gtc aga act gtg acg acc ggg gag ccc att aca tac tec acg tat ggt 4243 
Val Arg Thr Val Thr Thr Gly Glu Pro He Thr Tyr Ser Thr Tyr Gly 
1290 1295 1300 

aaa ttc etc gee gat ggg ggc tgc gca ggc ggc gec tat gac ate ate 4291 
Lys Phe Leu Ala Asp Gly Gly Cys Ala Gly Gly Ala Tyr Asp He He 
1305 1310 1315 

ata tgc gat gaa tgc cac tct gtg gat get acc act att etc ggc ate 4339 
He Cys Asp Glu Cys His Ser Val Asp Ala Thr Thr He Leu Gly He . 
1320 1325 ^ 1330 

ggg aca gtc ctt gac caa gca gag aca gec ggg gtc agg eta act gta 4387 
Gly Thr Val Leu Asp Gin Ala Glu Thr Ala Gly Val Arg Leu Thr Val 
1335 1340 1345 

ctg gec acg gec acg ccc ccc ggg teg gtg aca acc ccc cat ccc aat 4435 
Leu Ala Thr Ala Thr Pro Pro Gly Ser Val Thr Thr Pro His Pro Asn 
1350 1355 1360 1365 

ata gag gag gta gec etc gga cag gag ggt gag ate ccc ttc tat ggg 4483 
He Glu Glu Val Ala Leu Gly Gin Glu Gly Glu He Pro Phe Tyr Gly 
1370 1375 1380 

agg gcg ttt ccc ctg tct tac ate aag gga ggg agg cac ttg att ttc 4531 
Arg Ala Phe Pro Leu Ser Tyr He Lys Gly Gly Arg His Leu He Phe 
1385 1390 1395 

tgc cac tea aag aaa aag tgt gac gag etc gca acg gee ctt egg ggc 4579 
Cys His Ser Lys Lys Lys Cys Asp Glu Leu Ala Thr Ala Leu Arg Gly 
14 00 14 05 1410 

atg ggc ttg aac get gtg gca tat tac aga ggg ttg gac gtc tec ata 4627 
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Met Gly Leu Asn Ala Val Ala Tyr Tyr Arg Gly Leu Asp Val Ser lie 
1415 1420 1425 



ata cca act caa gga gat gtg gtg gtc gtt gcc acc gac gcc etc atg 4 675 
He Pro Thr Gin Gly Asp Val Val Val Val Ala Thr Asp Ala Leu Met 
1430 1435 1440 1445 

acg ggg tat act gga gac ttt gac tec gtg ate gac tgc aac gta gcg 4723 
Thr Gly Tyr Thr Gly Asp Phe Asp Ser Val He Asp Cys Asn Val Ala 
1450 1455 1460 

gtc acc cag gcc gta gac ttc age ctg gac ccc acc ttc act ata acc 4771 
Val Thr Gin Ala Val Asp Phe Ser Leu Asp Pro Thr Phe Thr He Thr 
1465 1470 1475 

aca cag act gtc ccg caa gac get gtc tea cgt agt cag cgc cga ggg 4819 
Thr Gin Thr Val Pro Gin Asp Ala Val Ser Arg Ser Gin Arg Arg Gly 
1480 1485 1490 

cgc acg ggt aga gga aga ctg ggc att tat agg tat gtt tec act ggt 4867 
Arg Thr Gly Arg Gly Arg Leu Gly He Tyr Arg Tyr Val Ser Thr Gly 
1495 1500 1505 

gag cga gcc tea gga atg ttt gac agt gta gta etc tgt gag tgc tac 4 915 
Glu Arg Ala Ser Gly Met Phe Asp Ser Val Val Leu Cys Glu Cys Tyr 
1510 ' 1515 1520 1525 

gac gca gga get get tgg tat gag etc tea cca gtg gag acg acc gtc 4963 
Asp Ala Gly Ala Ala Trp Tyr Glu Leu Ser Pro Val Glu Thr Thr Val 
1530 1535 1540 

agg etc agg gcg tat ttc aac acg cct ggc ttg cct gtg tgc cag gac 5011 
Arg Leu Arg Ala Tyr Phe Asn Thr Pro Gly Leu Pro Val Cys Gin Asp 
1545 1550 ' 1555 

cac ctt gag ttt tgg gag gca gtt ttc acc ggc etc aca cac ata gac 5059 
His Leu Glu Phe Trp Glu Ala Val Phe Thr Gly Leu Thr His He Asp 
1560 1565 1570 

get cat ttc ctt tec cag aca 
Ala His Phe Leu Ser Gin Thr 
1575 1580 

tta gta gcc tat cag gcc aca 

Leu Val Ala Tyr Gin Ala Thr 
1590 1595 

ccg tec tgg gac gtc atg tgg 
Pro Ser Trp Asp Val Met Trp 
1610 

ctt gtg ggc cct aca cct etc 
Leu Val Gly Pro Thr Pro Leu 
1625 

gag gtc acc ctt aca cac ccc 
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aag cag teg ggg gaa aat ttc gca tac 5107 

Lys Gin Ser Gly Glu Asn Phe Ala Tyr 
1585 

gtg tgc gcc agg gcc aaa gcg ccc ccc 5155 

Val Cys Ala Arg Ala Lys Ala Pro Pro 
1600 1605 

aag tgc ttg act cga etc aag ccc acg 5203 

Lys Cys Leu Thr Arg Leu Lys Pro Thr 
1615 1620 

ctg tac cgt ttg ggc tct gtt acc aac 5251 

Leu Tyr Arg Leu Gly Ser Val Thr Asn 
1630 ■ 1635 

gtg aca aaa tac ate gcc aca tgc atg 5299 



Glu Val Thr Leu Thr His Pro Val Thr Lys Tyr He Ala Thr Cys Met 
1640 1645 1650 



caa get gac etc gag gtc atg ace age acg tgg gtc ctg get ggg gga 5347 
Gin Ala Asp Leu Glu Val Met Thr Ser Thr Trp Val Leu Ala Gly Gly 
1655 1660 1665 

gtc tta gca gec gtc gec gcg tat tgc tta gcg acc ggg tgt gtt tec 5395 
Val Leu Ala Ala Val Ala Ala Tyr Cys Leu Ala Thr Gly Cys Val Ser 
1670 1675 1680 1685 

ate att ggc cgt tta cac ate aac cag cga get gtc gtc get ccg gac 5443 
He He Gly Arg Leu His He Asn Gin Arg Ala Val Val Ala Pro Asp 
1690 1695 1700 

aag gag gtc etc tat gag get ttt gat gag atg gag gaa tgt gec tec 54 91 
Lys Glu Val Leu Tyr Glu Ala Phe Asp Glu Met Glu Glu Cys Ala Ser 
1705 1710 1715 

aga gcg get etc ctt gaa gag ggg cag egg ata gec gag atg ctg aag 5539 
Arg Ala Ala Leu Leu Glu Glu Gly Gin Arg He Ala Glu Met Leu Lys 
1720 1725 1730 

tec aag ate caa ggc tta ttg cag caa gec tct aaa cag gee cag gac 5587 
Ser Lys He Gin Gly Leu Leu Gin Gin Ala Ser Lys Gin Ala Gin Asp 
1735 1740 1745 

ata caa ccc get gtg caa get teg tgg ccc aag atg gag caa ttc tgg 5635 
He Gin Pro Ala Val Gin Ala Ser Trp Pro Lys Met Glu Gin Phe Trp 
1750 1755 1760 1765 

gec aaa cat atg tgg aac ttc ata age ggc att cag tac etc gca gga 5683 
Ala Lys His Met Trp Asn Phe He Ser Gly He Gin Tyr Leu Ala Gly 
1770 1775 1780 

ctg tea aca ctg cca ggg aac cct get gtg get tec atg atg gca ttc 5731 
Leu Ser Thr Leu Pro Gly Asn Pro Ala Val Ala Ser Met Met Ala Phe 
1785 1790 1795 

age gee gec etc acc agt ccg ttg tea act age acc acc ate ctt ctt 5779 
Ser Ala Ala Leu Thr Ser Pro Leu Ser Thr Ser Thr Thr He Leu Leu 
1800 1805 1810 

aac att ctg ggg ggc tgg ctg gcg tec caa att gcg cca ccc gcg ggg 5827 
Asn He Leu Gly Gly Trp Leu Ala Ser Gin He Ala Pro Pro Ala Gly 
1815 1820 1825 

gec act ggc ttt gtt gtc agt ggc ctg gtg gga get get gtt ggc age 5875 
Ala Thr Gly Phe Val Val Ser Gly Leu Val Gly Ala Ala Val Gly Ser 
1830 1835 1840 1845 

ata ggc ttg ggt aaa gtg ctg gtg gac ate ctg gca ggg tat ggt gcg 5923 
He Gly Leu Gly Lys Val Leu Val Asp He Leu Ala Gly Tyr Gly Ala 
1850 1855 1860 

ggc att teg ggg gec etc gtc gcg ttt aag ate atg tct ggc gag aag 5971 
Gly He Ser Gly Ala Leu Val Ala Phe Lys He Met Ser Gly Glu Lys 
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1865 



1870 



1875 



ccc tec atg gag gat gtc ate aac ttg ctg cct ggg att ctg tct cca 6019 
Pro Ser Met Glu Asp Val lie Asn Leu Leu Pro Gly lie Leu Ser Pro 
1880 1885 1890 



ggt get ctg gtg gtg gga gtc ate tgc gcg gee att ctg cgc cgc cat 6067 
Gly Ala Leu Val Val Gly Val lie Cys Ala Ala lie Leu Arg Arg His 
1895 1900 1905 

gtg gga ccg ggg gaa ggc gcg gtc caa tgg atg aac agg ctt ate gec 6115 
Val Gly Pro Gly Glu Gly Ala Val Gin Trp Met Asn Arg Leu lie Ala 
1910 1915 1920 1925 

ttc get tec aga gga aac cac gtc gec cct act cac tac gtg acg gag 6163 
Phe Ala Ser Arg Gly Asn His Val Ala Pro Thr His Tyr Val Thr Glu 
1930 1935 1940 

teg gat gcg teg cag cgt gtc ace 'caa ctg ctt ggc tct etc act ata 6211 
Ser Asp Ala Ser Gin Arg Val Thr Gin Leu Leu Gly Ser Leu Thr lie 
1945 1950 1955 

act agt eta etc agg aga ctt cac aac tgg ate act gag gat tgc ccc 6259 
Thr Ser Leu Leu Arg Arg Leu His Asn Trp lie Thr Glu Asp Cys Pro 
1960 " 1965 1970 

ate cca tgc gee ggc teg tgg etc cgc gat gtg tgg gac tgg gtc tgt 6307 
lie Pro Cys Ala Gly Ser Trp Leu Arg Asp Val Trp Asp Trp Val Cys 
1975 1980 1985 

ace ate eta aca gac ttt aag aac tgg ctg ace tec aag ctg ttc cca 6355 
Thr lie Leu Thr Asp Phe Lys Asn Trp Leu Thr Ser Lys Leu Phe Pro 
1990 1995 2000 2005 

aag atg cct ggc etc ccc ttt ate tct tgc caa aag ggg tac aag ggc 6403 
Lys Met Pro Gly Leu Pro Phe lie Ser Cys Gin Lys Gly Tyr Lys Gly 
2010 2015 2020 

gtg tgg gee ggc act ggc ate atg acc aca cga tgc ccc tgc ggc gee 6451 
Val Trp Ala Gly Thr Gly lie Met Thr Thr Arg Cys Pro Cys Gly Ala 
2025 2030 2035 

aac ate tct ggc aac gtc cgc ttg ggc tct atg aga ate aca gga ccc 6499 
Asn lie Ser Gly Asn Val Arg Leu Gly Ser Met Arg lie Thr Gly Pro 
2040 2045 2050 

aaa acc tgc atg aac acc tgg cag ggg acc ttt cct ate aat tgt tat 6547 
Lys Thr Cys Met Asn Thr Trp Gin Gly Thr Phe Pro lie Asn Cys Tyr 
2055 2060 2065 

aca gaa ggc cag tgc ttg ccg aaa ccc gcg tta aac ttc aag acc gee 6595 
Thr Glu Gly Gin Cys Leu Pro Lys Pro Ala Leu Asn Phe Lys Thr Ala 
2070 2075 2080 2085 

ate tgg aga gtg gcg gee tea gag tac gcg gaa gtg acg cag cac gga 6643 
He Trp Arg Val Ala Ala Ser Glu Tyr Ala Glu Val Thr Gin His Gly 



36 



2090 



2095 



2100 



tea tat gee tat ata aca ggg ctg acc act gac aac tta aaa gtc cct 6691 

Ser Tyr Ala Tyr lie Thr Gly Leu Thr Thr Asp Asn Leu Lys Val Pro 

2105 2110 2115 

tgc caa etc ccc tct cca gag ttt ttc tct tgg gtg gac gga gta caa 6739 

Cys Gin Leu Pro Ser Pro Glu Phe Phe Ser Trp Val Asp Gly Val Gin 

2120 2125 2130 

ate cat agg tec gec ccc aca cca aag ccg ttt ttc egg gat gag gtc 6787 

lie His Arg Ser Ala Pro Thr Pro Lys Pro Phe Phe Arg Asp Glu Val 

2135 2140 2145 

teg ttc age gtt ggg etc aat tea ttt gtc gtc ggg tct cag ctt ccc 6835 

Ser Phe Ser Val Gly Leu Asn Ser Phe Val Val Gly Ser Gin Leu Pro 

2150 2155 2160 2165 

tgt gac cct gag ccc gac act gag gta gtg atg tec atg eta aca gac 6883 

Cys Asp Pro Glu Pro Asp Thr Glu Val Val Met Ser Met Leu Thr Asp 

2170 2175 2180 

cca tec cat ate acg gcg gag get gca gcg egg cgt tta gcg egg ggg 6931 

Pro Ser His lie Thr Ala Glu Ala Ala Ala Arg Arg Leu Ala Arg Gly 

2185 2190 2195 

tea ccc cca tct gag gca age tec tea gcg age cag ctg teg gcg cca 6979 

Ser Pro Pro Ser Glu Ala Ser Ser Ser Ala Ser Gin Leu Ser Ala Pro 

2200 2205 2210 

teg ctg cga gec acc tgc acc acc cac ggt agg acc tat gat gtg gac 7027 

Ser Leu Arg Ala Thr Cys Thr Thr His Gly Arg Thr Tyr Asp Val Asp 

2215 2220 2225 

atg gtg gat gee aac ctg ttc atg ggg ggc ggc gtg att egg ata gag 7075 

Met Val Asp Ala Asn Leu Phe Met Gly Gly Gly Val He Arg He Glu 

2230 2235 2240 2245 

tct gag tec aaa gtg gtc gtt ctg gac tec etc gac tea atg acc gag 7123 

Ser Glu Ser Lys Val Val Val Leu Asp Ser Leu Asp Ser Met Thr Glu 

2250 2255 2260 

gaa gag ggc gac ctt gag cct tea gta cca teg gag tat atg etc ccc 7171 

Glu Glu Gly Asp Leu Glu Pro Ser Val Pro Ser Glu Tyr Met Leu Pro 

2265 2270 2275 

agg aag agg ttc cca ccg gee tta ccg get tgg gcg egg cct gat tac 7219 

Arg Lys Arg Phe Pro Pro Ala Leu Pro Ala Trp Ala Arg Pro Asp Tyr 

2280 2285 2290 

aac cca ccg ctt gtg gaa teg tgg aag agg cca gat tac caa cca ccc 7267 

Asn Pro Pro Leu Val Glu Ser Trp Lys Arg Pro Asp Tyr Gin Pro Pro 

2295 2300 2305 

act gtt gcg ggc tgt get etc ccc ccc ccc aaa aag acc ccg acg cct 7315 

Thr Val Ala Gly Cys Ala Leu Pro Pro Pro Lys Lys Thr Pro Thr Pro 

2310 2315 2320 2325 
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cct cca agg aga cgc egg aca gtg ggt ctg age gag age ace ata gga 7363 
Pro Pro Arg Arg Arg Arg Thr Val Gly Leu Ser Glu Ser Thr lie Gly 
2330 2335 2340 

gat gee etc caa cag ctg gee ate aag tec ttt ggc cag ccc ccc cca 7411 
Asp Ala Leu Gin Gin Leu Ala lie Lys Ser Phe Gly Gin Pro Pro Pro 
2345 2350 2355 

age ggc gat tea ggc ctt tec acg ggg gcg gac gee gee gac tec ggc 7459 
Ser Gly Asp Ser Gly Leu Ser Thr Gly Ala Asp Ala Ala Asp Ser Gly 
2360 2365 2370 

gat egg aca ccc cct gac gag ttg get ctt teg gag aca ggt tct acc 7507 
Asp Arg Thr Pro Pro Asp Glu Leu Ala Leu Ser Glu Thr Gly Ser Thr 
2375 2380 2385 

tec tec atg ccc ccc etc gag ggg gag cct ggg gac cca gac ctg gag 7555 
Ser Ser Met Pro Pro Leu Glu Gly Glu Pro Gly Asp Pro Asp Leu Glu 
2390 2395 2400 2405 

cct gag cag gta gag ctt caa cct cct ccc cag ggg ggg gag gca get 7603 
Pro Glu Gin Val Glu Leu Gin Pro Pro Pro Gin Gly Gly Glu Ala Ala 
2410 2415 2420 

ccc ggc teg gac teg ggg tec tgg tct act tgc tec gag gag gat gac 7651 
Pro Gly Ser Asp Ser Gly Ser Trp Ser Thr Cys Ser Glu Glu Asp Asp 
2425 2430 2435 

tec gtc gtg tgc tgc tec atg tea tat tec tgg acc ggg get eta ata 7699 
Ser Val Val Cys Cys Ser Met Ser Tyr Ser Trp Thr Gly Ala Leu lie 
2440 " 2445 ~ 2450 

act cct tgt age ccc gaa gag gaa aag ttg cca att aac tec ttg age 7747 
Thr Pro Cys Ser Pro Glu Glu Glu Lys Leu Pro lie Asn Ser Leu Ser 
2455 2460 ^ 2465 

aac teg ctg ttg cga tac cat aac aag gta tac tgt act aca tea aag 7795 
Asn Ser Leu Leu Arg Tyr His Asn Lys Val Tyr Cys Thr Thr Ser Lys 
2470 2475 " 2480 2485 

agt gee tea eta agg get aaa aag gta act ttt gat agg atg caa gtg 7843 
Ser Ala Ser Leu Arg Ala Lys Lys Val Thr Phe Asp Arg Met Gin Val 
2490 2495 2500 

etc gac gee tat tat gat tea gtc tta aag gac ate aag eta gcg gee 7891 
Leu Asp Ala Tyr Tyr Asp Ser Val Leu Lys Asp lie Lys Leu Ala Ala 
2505 2510 2515 

tec aag gtc age gca agg etc etc acc tta gag gag gcg tgc caa ttg 7939 
Ser Lys Val Ser Ala Arg Leu Leu Thr Leu Glu Glu Ala Cys Gin Leu 
2520 2525 2530 

acc cca ccc cac tct gca aga tec aag tat ggg ttt ggg get aag gag 7987 
Thr Pro Pro His Ser Ala Arg Ser Lys Tyr Gly Phe Gly Ala Lys Glu 
2535 2540 2545 
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gtc cgc age ttg tec ggg agg gee gtc aac cac ate aag tec gtg tgg 8035 

Val Arg Ser Leu Ser Gly Arg Ala Val Asn His lie Lys Ser Val Trp 

2550 2555 2560 2565 

aag gac etc ttg gaa gac tea caa aca cca att cct aca ace ate atg 8083 

Lys Asp Leu Leu Glu Asp Ser Gin Thr Pro lie Pro Thr Thr lie Met 

2570 2575 2580 

gee aaa aat gag gtg ttc tgc gtg gac ccc gee aag ggg ggt aaa aaa 8131 

Ala Lys Asn Glu Val Phe Cys Val Asp Pro Ala Lys Gly Gly Lys Lys 
2585 2590 2595 



cca get cgc ctt ate gtt tac cct gac etc ggc gtc agg gtc tgc gag 
Pro Ala Arg Leu lie Val Tyr Pro Asp Leu Gly Val Arg Val Cys Glu 
2600 2605 2610 



8179 



aag atg gee ctt tat gat gtc aca caa aag ctt cct cag gcg gtg atg 8227 

Lys Met Ala Leu Tyr Asp Val Thr Gin Lys Leu Pro Gin Ala Val Met 
2615 " 2620 2625 

ggg get tct tat ggc ttc cag tac tec ccc get cag egg gtg gag ttt 8275 

Gly Ala Ser Tyr Gly Phe Gin Tyr Ser Pro Ala Gin Arg Val Glu Phe 
2630 2635 2640 2645 

etc ttg aag gca tgg gcg gaa aag aga gac cct atg ggt ttt teg tat 8323 

Leu Leu Lys Ala Trp Ala Glu Lys Arg Asp Pro Met Gly Phe Ser Tyr 
2650 2655 2660 

gat acc cga tgc ttt gac tea acc gtc act gag aga gac ate agg act 8371 

Asp Thr Arg Cys Phe Asp Ser Thr Val Thr Glu Arg Asp He Arg Thr 
2665 2670 2675 

gag gag tec ata tac cag gee tgc tec tta ccc gag gag gee cga act 8419 

Glu Glu Ser He Tyr Gin Ala Cys Ser Leu Pro Glu Glu Ala Arg Thr 
2680 2685 2690 

gee ata cac teg ctg act gag aga etc tat gtg gga ggg ccc atg ttc 84 67 

Ala He His Ser Leu Thr Glu Arg Leu Tyr Val Gly Gly Pro Met Phe 
2695 2700 2705 

aac age aag ggc cag tec tgc ggg tac agg cgt tgc cgc gee age ggg 8515 

Asn Ser Lys Gly Gin Ser Cys Gly Tyr Arg Arg Cys Arg Ala Ser Gly 
2710 2715 2720 2725 

gtg ctt acc act agt atg ggg aac acc ate aca tgc tat gta aaa gee 8563 

Val Leu Thr Thr Ser Met Gly Asn Thr He Thr Cys Tyr Val Lys Ala 
2730 2735 2740 

eta gcg get tgc aag get gcg ggg ata att gcg ccc acg atg ctg gta 8611 

Leu Ala Ala Cys Lys Ala Ala Gly He He Ala Pro Thr Met Leu Val 
2745 2750 2755 

tgc ggc gac gac ttg gtc gtc ate tea gaa age cag ggg act gag gag. 8659 

Cys Gly Asp Asp Leu Val Val He Ser Glu Ser Gin Gly Thr Glu Glu 
2760 2765 2770 

gac gag egg aac ctg aga gee ttc acg gag get atg acc agg tat tct 8707 
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Asp Glu Arg Asn Leu Arg Ala Phe Thr Glu Ala Met Thr Arg Tyr Ser 
2775 2780 2785 



gcc cct cct ggt gac ccc ccc aga ccg gaa tat gac ctg gag eta ata 8755 
Ala Pro Pro Gly Asp Pro Pro Arg Pro Glu Tyr Asp Leu Glu Leu lie 
2790 2795 2800 2805 

aca tct tgt tec tea aac gtg tct gtg gca ctt ggc cca cag ggc cgc 8803 
Thr Ser Cys Ser Ser Asn Val Ser Val Ala Leu Gly Pro Gin Gly Arg 
2810 2815 2820 

cgc aga tac tac ctg acc aga gac ccc acc act tea att gcc egg get 8851 
Arg Arg Tyr Tyr Leu Thr Arg Asp Pro Thr Thr Ser lie Ala Arg Ala 
2825 2830 2835 

gcc tgg gaa aca gtt aga cac tec cct gtc aat tea tgg ctg gga aac 8899 
Ala Trp Glu Thr Val Arg His Ser Pro Val Asn Ser Trp Leu Gly Asn 
2840 2845 2850 

ate ate cag tac get cca acc ata tgg gtt cgc atg gtc ctg atg aca 8947 
He He Gin Tyr Ala Pro Thr He Trp Val Arg Met Val Leu Met Thr 
2855 2860 2865 

cac ttc ttc tec att etc atg gcc cag gac acc eta gac cag aac ctt 8995 
His Phe Phe Ser He Leu Met Ala Gin Asp Thr Leu Asp Gin Asn Leu 
2870 2875 2880 2885 

aac ttt gaa atg tac gga teg gtg tac tec gtg agt cct ctg gac etc 9043 
Asn Phe Glu Met Tyr Gly Ser Val Tyr Ser Val Ser Pro Leu Asp Leu 
2890 2895 2900 

cca gcc ata att gaa agg tta cac ggg ctt gac gcc ttc tct ctg cac 9091 
Pro Ala He He Glu Arg Leu His Gly Leu Asp Ala Phe Ser Leu His 
2905 ' 2910 2915" 

aca tac act ccc cac gaa ctg acg egg gtg get tea gcc etc aga aaa 9139 
Thr Tyr Thr Pro His Glu Leu Thr Arg Val Ala Ser Ala Leu Arg Lys 
2920 2925 2930 

ctt ggg gcg cca ccc etc aga gcg tgg aag agt egg gcg cgt gca gtt 9187 
Leu Gly Ala Pro Pro Leu Arg Ala Trp Lys Ser Arg Ala Arg Ala Val 
2935 2940 ~ 2945 

a 99 tec etc ate tec cgt ggg ggg agg gcg gcc gtt tgc ggt egg 9235 

Arg Ala Ser Leu He Ser Arg Gly Gly Arg Ala Ala Val Cys Gly Arg 
2950 2955 2960 2965 

tac etc ttc aac tgg gcg gtg aag acc aag etc aaa etc act cct ttg 9283 
Tyr Leu Phe Asn Trp Ala Val Lys Thr Lys Leu Lys Leu Thr Pro Leu 
2970 2975 2980 

ccg gag gca cgc etc ctg gat ttg tec agt tgg ttt acc gtc ggc gcc 9331 
Pro Glu Ala Arg Leu Leu Asp Leu Ser Ser Trp Phe Thr Val Gly Ala 
2985 2990 2995 

ggc ggg ggc gac att tat cac age gtg teg cgt gcc cga ccc cgc eta 9379 
Gly Gly Gly Asp He Tyr His Ser Val Ser Arg Ala Arg Pro Arg Leu 
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3000 



3005 



3010 



tta etc ctt age eta etc eta ctt tct gta ggg gta ggc etc ttc eta 9427 
Leu Leu Leu Ser Leu Leu Leu Leu Ser Val Gly Val Gly Leu Phe Leu 
3015 3020 3025 

etc ccc get cga tag agcggcacac attagctaca ctccatagct aactgttcct 9482 

Leu Pro Ala Arg 

3030 

tttttttttt tttttttttt tttttttttt tttttttctt tttttttttt tttccctctt 9542 

tcttcccttc tcatcttatt ctactttctt tcttggtggc tccatcttag ccctggtcac 9602 

ggctagctgt gaaaggtccg tgagcegcat gaetgeagag agtgccgtaa ctggtctctc 9662 

tgcagatcat gt 9674 



<210> 6 
<211> 3033 
<212> PRT 
<213> Hepatitis 

<400> 6 

Met Ser Thr Asn 
1 

Arg Arg Pro Gin 
20 

Gly Val Tyr Leu 
35 

Thr Arg Lys Ala 
50 

lie Pro Lys His 
65 

Tyr Pro Trp Pro 

Leu Leu Ser Pro 
100 

Arg His Arg Ser 
115 

Gly Phe Ala Asp 
130 

Ser Gly Val Ala 
145 

Gly Val Asn Phe 

Phe Leu Leu Ala 
180 

Gin Val Lys Asn 
195 

Asn Asp Ser lie 
210 

Gly Cys Val Pro 
225 

Pro Val Ser Pro 



C virus 



Pro Lys Pro Gin 
5 

Asp Val Lys Phe 

Leu Pro Arg Arg 
40 

Ser Glu Arg Ser 
55 

Arg Arg Ser Thr 
70 

Leu Tyr Gly Asn 
85 

Arg Gly Ser Arg 

Arg Asn Val Gly 
120 

Leu Leu Gly Tyr 
135 

Ser Ala Leu Ala 
150 

Ala Thr Gly Asn 
165 

Leu Leu Ser Cys 

Thr Ser Asn Ala 
200 

Thr Trp Gin Leu 
215 

Cys Glu Lys Met 
230 

Asn Val Ala Val 
245 



Arg 


T.\7Q 


Thr 


T,\;c 
J 




10 






Pro 


Gly 


Gly 


Gly 


25 








Gly 


Pro 


Arg 


Leu 


Gin 


Pro 


Arg 


Gly 








60 


Gly 


Lys 


Ser 


Trp 






75 




Glu 


Gly 


Leu 


Gly 




90 






Pro 


Ser 


Trp 


Gly 


105 








Lys 


Val 


He 


Asp 


Val 


Pro 


Val 


Val 








140 


His 


Gly 


Val 


Arg 






155 




Leu 


Pro 


Gly 


Cys 




170 






He 


Thr 


Thr 


Pro 


185 








Tyr 


Met 


Ala 


Thr 


Glu 


Ala 


Ala 


Val 








220 


Gly 


Asn 


Thr 


Ser 






235 




Arg 


Gin 


Pro 


Gly 




250 







Arg Asn Thr Asn 
15 

Gin He Val Gly 
30 

Gly Val Arg Ala 
45 

Arg Arg Gin Pro 

Gly Lys Pro Gly 
80 

Trp Ala Gly Trp 
95 

Pro Asn Asp Pro 
110 

Thr Leu Thr Cys 
125 

Gly Ala Pro Leu 

Val Leu Glu Asp 
160 

Ser Phe Ser He 
175 

Val Ser Ala Val 
190 

Asn Asp Cys Ser 
205 

Leu His Val Pro 

Arg Cys Trp He 
240 

Ala Leu Thr Arg 
255 
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Gly Leu 


Arg 


Thr 


His 


He 


Asp 


Met 


Val 


Val 


Leu 


Ser 


Ala 


Thr 


Leu 


Cys 








260 










265 










270 






Ser 


Ala 


Leu 


Tyr 


Val 


Gly 


Asp 


Leu 


Cys 


Gly 


Gly 


Val 


Met 


Leu 


Ala 


Ser 






275 










280 










285 








Gin 


Met 


Phe 


He 


Val 


Ser 


Pro 


Gin 


His 


His 


Trp 


Phe 


Val 


Gin 


Glu 


Cys 




290 










295 










300 










Asn 


Cys 


Ser 


He 


Tyr 


Pro 


Gly 


Ala 


He 


Thr 


Gly 


His 


Arg 


Met 


Ala 


Trp 


305 










310 










315 










320 


Asp Met 


Met 


Met 


Asn 


Trp 


Ser 


Pro 


Thr 


Thr 


Thr 


Met 


He 


Leu 


Ala 


Tyr 










325 










330 










335 




Val 


Met 


Arg 


Val 


Pro 


Glu 


Val 


He 


He 


Asp 


He 


He 


Ser 


Gly 


Ala 


His 








340 










345 










350 






Trp Gly 


Val 


Met 


Phe 


Gly 


Leu 


Ala 


Tyr 


Phe 


Ser 


Met 


Gin 


Gly 


Ala 


Trp 






355 










360 










365 








Ala 


Lys 


Val 


Val 


Val 


He 


Leu 


Leu 


Leu 


Ala 


Ser 


Gly 


Val 


Asp 


Ala 


Tyr 




370 










375 










380 










Thr. 


Thr 


Thr 


Thr 


Gly 


Ser 


Ala 


Ala 


Gly 


Arg 


Thr 


Thr 


Ser 


Ser 


Leu 


Ala 


385 










390 










395 










400 


Ser 


Ala 


Phe 


Ser 


Pro 


Gly 


Ala 


Arg 


Gin 


Asn 


He 


Gin 


Leu 


He 


Asn 


Thr 










405 










410 










415 




Asn Gly 


Ser 


Trp 


His 


He 


Asn 


Arg 


Thr 


Ala 


Leu 


Asn 


Cys 


Asn 


Asp 


Ser 








420 










425 










430 






Leu 


His 


Thr 


Gly 


Phe 


Phe 


Thr 


Ala 


Leu 


Phe 


Tyr 


He 


His 


Lys 


Phe 


Asn 






435 










440 










445 








Ser 


Ser 


Gly 


Cys 


Pro 


Glu 


Arg 


Leu 


Ser 


Ala 


Cys 


Arg 


Asn 


He 


Glu 


Asp 




450 










455 










460 










Phe 


Arg 


He 


Gly 


Trp 


Gly 


Ala 


Leu 


Gin 


Tyr 


Asp 


Asp 


Asn 


Val 


Thr 


Asn 


465 










470 










475 










480 


Pro 


Glu 


Asp 


Met 


Arg 


Pro 


Tyr 


Cys 


Trp 


His 


Tyr 


Pro 


Pro 


Lys 


Gin 


Cys 










485 










490 










495 




Gly 


Val 


Val 


Pro 


Ala 


Gly 


Thr 


Val 


Cys 


Gly 


Pro 


Val 


Tyr 


Cys 


Phe 


Thr 








500 










505 










510 






Pro 


Ser 


Pro 


Val 


Val 


Val 


Gly 


Thr 


Thr 


Asp 


Arg 


Leu 


Gly 


Val 


Pro 


Thr 






515 










520 










525 








Tyr 


Thr 


Trp 


Gly 


Glu 


Asn 


Glu 


Thr 


Asp 


Val 


Phe 


Leu 


Leu 


Asn 


Ser 


Thr 




530 










535 










540 










Arg 


Pro 


Pro 


Ser 


Gly 


Ser 


Trp 


Phe 


Gly 


Cys 


Thr 


Trp 


Met 


Asn 


Ser 


Thr 


545 










550 










555 










560 


Gly 


Phe 


Thr 


Lys 


Thr 


Cys 


Gly 


Ala 


Pro 


Pro 


Cys 


Arg 


Thr 


Arg 


Ala 


Asp 










565 










570 










575 




Phe 


Asn 


Thr 


Ser 


Thr 


Asp 


Leu 


Leu 


Cys 


Pro 


Thr 


Asp 


Cys 


Phe 


Arg 


Lys 








580 










585 










590 






His 


Pro 


Glu 


Ala 


Thr 


Tyr 


He 


Lys 


Cys 


Gly 


Ser 


Gly 


Pro 


Trp 


Leu 


Thr 






595 










600 










605 








Pro 


Lys 


Cys 


Leu 


Val 


Asp 


Tyr 


Pro 


Tyr 


Arg 


Leu 


Trp 


His 


Tyr 


Pro 


Cys 




610 










615 










620 










Thr 


Val 


Asn 


Tyr 


Ser 


Thr 


Phe 


Lys 


He 


Arg 


Met 


Tyr 


Val 


Gly 


Gly 


Val 


625 










630 










635 










640 


Glu 


His 


Arg 


Leu 


Met 


Ala 


Ala 


Cys 


Asn 


Phe 


Thr 


Arg 


Gly 


Asp 


Arg 


Cys 










645 










650 










655 




Asn 


Leu 


Glu 


Asp 


Arg 


Asp 


Arg 


Ser 


Gin 


Gin 


Thr 


Pro 


Leu 


Leu 


His 


Ser 








660 










665 










670 






Thr 


Thr 


Glu 


Trp 


Ala 


He 


Leu 


Pro 


Cys 


Ser 


Phe 


Ser 


Asp 


Leu 


Pro 


Ala 






675 










680 










685 








Leu 


Ser 


Thr 


Gly 


Leu 


Leu 


His 


Leu 


His 


Gin 


Asn 


He 


Val 


Asp 


Val 


Gin 




690 










695 










700 










Tyr 


Met 


Tyr 


Gly 


Leu 


Ser 


Pro 


Ala 


Leu 


Thr 


Gin 


Tyr 


He 


Val 


Arg 


Trp 
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705 710 



Glu 


Tro 


Val 


Val 


Leu 


Leu 


Phe 


Leu 










725 








Ala 


Cys 


Leu 


Tro 


Met 


Leu 


He 


Leu 








740 










Glu 


Lys 


Leu 


Val 


Val 


Leu 


His 


Ala 






755 










760 


Phe 


Leu 


Tvr 


Phe 


Val 


He 


Phe 


Leu 




770 










775 




Arg 


Val 


Val 


Pro 


Leu 


Ala 


Ala 


Tvr 
y 


785 










790 






Cys 


Leu 


Leu 


Leu 


Leu 


Ala 


Leu 


Pro 










805 








Ser 


Val 


His 


Gly 


Gin 


Val 


Glv 


Ala 








820 










Phe 


Thr 


Leu 


Thr 


Pro 


Glv 


Tvr 
y 


Lys 






835 










840 


Trp 


Leu 


Cys 


Tyr 


Leu 


Leu 


Thr 


Leu 




850 










855 




Ala 


Pro 


Ser 


Met 


Gin 


Ala 


Arg 


Glv 


865 










870 






Ala 


Thr 


He 


Phe 


Cvs 


Pro 


Gly 


Val 










885 








Leu 


Ala 


Val 


Leu 


Gly 


Pro 


Gly 


Tyr 








900 










Val 


Pro 


Tvr 


Phe 


Val 


Arg 


Ala 


His 






915 










920 


Val 


Arg 


His 


Leu 


Ala 


Glv 
y 


Glv 


Arg 




930 










935 




Leu 


Gly 


Arg 


Tro 


Thr 


Gly 


Thr 


Tyr 


945 










950 






Ser 


Asp 


Tro 


Ala 


Ala 


Ser 


Gly 


Leu 










965 








Pro 


He 


He 


Phe 


Ser 


Pro 


Met 


Glu 








980 










Glu 


Thr 


Ala 


Ala 


Cys 


Gly 


Asp 


He 






995 








1000 


Arg 


Leu 


Gly 


Arg 


Glu 


He 


Leu 


Leu 


1010 








1015 




Lys 


Gly 


Trp 


Lys 


Leu 


Leu 


Ala 


Pro 


1025 






1030 






Arg 


Gly 


Leu 


Leu 


Gly 


Ser 


He 


Val 








1045 








Thr 


Glu 


Gin 


Ala 


Gly 


Glu 


Val 


Gin 






1060 










Phe 


Leu 


Gly 


Thr 


Ser 


He 


Ser 


Gly 




1075 








1080 


Ala 


Gly Asn 


Lys 


Thr 


Leu 


Ala 


Gly 


1090 








1095 




Tyr 


Ser 


Ser 


Ala 


Glu 


Gly Asp 


Leu 


1105 






1110 






Thr 


Lys 


Ser 


Leu 


Glu 


Pro 


Cys 


Thr 








1125 








Val 


Thr Arg Asn Ala 


Asp 


Val 


He 






1140 










Arg 


Gly Ala 


Leu 


Leu 


Ser 


Pro 


Arg 







715 










720 


Leu 


Leu 


Ala 


Asp 


Ala 


Arg 


Val 


Cys 




730 










735 




Leu 


Gly 


Gin 


Ala 


Glu 


Ala 


Ala 


Leu 


745 










750 






Ala 


Ser 


Ala 


Ala 


Ser 


Cys 


Asn 


Gly 










765 








Val 


Ala 


Ala 


Tro 

XT 


His 


He 


Lvs 


Gly 
y 








780 










Ser 


Leu 


Thr 


Glv 


Leu 


Trp 


Pro 


Phe 






795 










800 


Gin 


Gin 


Ala 


Tyr 


Ala 


Tyr 


Asp 


Ala 




810 










815 




Ala 


Leu 


Leu 


Val 


Leu 


He 


Thr 


Leu 


825 










830 






Thr 


Leu 


Leu 


Ser 


Gin 


Ser 


Leu 


Trp 










845 








Ala 


Glu 


Thr 


Met 


Val 


Gin 


Glu 


Trp 








860 










Glv 


Arg 


Asp 


Glv 


He 


He 


Trp 


Ala 






875 










880 


Val 


Phe 


Asp 

IT 


He 


Thr 


Lys 


Trp 


Leu 




890 










895 




Leu 


Leu 


Arg 


Gly 


Ala 


Leu 


Thr 


Arg 


905 










910 






Ala 


Leu 


Leu 


Arg 


Met 


Cys 


Thr 


Met 










925 








Tyr 


Val 


Gin 


Met 


Ala 


Leu 


Leu 


Ala 








940 










He 


Tvr 
y 


Asp 


His 


Leu 


Thr 


Pro 


Met 






955 










960 


Arg 


Asp 


Leu 


Ala 


Val 


Ala 


Val 


Glu 




970 










975 




Lys 


Lys 


Val 


He 


Val 


Trp 


Gly Ala 


985 










990 






Leu 


His 


Gly 


Leu 


Pro 


Val 


Ser 


Ala 








1005 








Gly 


Pro 


Ala 


Asp 


Gly 


Tyr 


Thr 


Ser 






1020 










He 


Thr 


Ala 


Tyr 


Ala 


Gin 


Gin 


Thr 




1035 








1040 


Val 


Ser 


Met 


Thr 


Gly 


Arg 


Asp 


Lys 


1050 








1055 




Val 


Leu 


Ser 


Thr 


Val 


Thr 


Gin 


Ser 


.065 








1070 






Val 


Leu 


Trp 


Thr 


Val 


Tyr 


His 


Gly 








1085 








Ser 


Arg 


Gly 


Pro 


Val 


Thr 


Gin 


Met 






1100 










Val 


Gly Trp 


Pro 


Ser 


Pro 


Pro 


Gly 




1115 








1120 


Cys 


Gly Ala 


Val 


Asp 


Leu 


Tyr 


Leu 


1130 








1135 




Pro Ala Arg Arg Arg 


Gly Asp 


Lys 


.145 








1150 






Pro 


Leu 


Ser 


Thr 


Leu 


Lys 


Gly 


Ser 
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1155 1160 1165 

Ser Gly Gly Pro Val Leu Cys Pro Arg Gly His Ala Val Gly lie Phe 

1170 1175 ~ 1180 

Arg Ala Ala Val Cys Ser Arg Gly Val Ala Lys Ser lie Asp Phe lie 
1185 1190 1195 1200 

Pro Val Glu Thr Leu Asp lie Val Thr Arg Ser Pro Thr Phe Ser Asp 

1205 1210 1215 

Asn Ser Thr Pro Pro Ala Val Pro Gin Thr Tyr Gin Val Gly Tyr Leu 

1220 1225 1230 

His Ala Pro Thr Gly Ser Gly Lys Ser Thr Lys Val Pro Val Ala Tyr 

1235 1240 1245 

Ala Ala Gin Gly Tyr Lys Val Leu Val Leu Asn Pro Ser Val Ala Ala 

1250 1255 1260 

Thr Leu Gly Phe Gly Ala Tyr Leu Ser Lys Ala His Gly lie Asn Pro 
1265 1270 1275 1280 

Asn lie Arg Thr Gly Val Arg Thr Val Thr Thr Gly Glu Pro lie Thr 

1285 1290 1295 

Tyr Ser Thr Tyr Gly Lys Phe Leu Ala Asp Gly Gly Cys Ala Gly Gly 

1300 1305 1310 

Ala Tyr Asp lie lie lie Cys Asp Glu Cys His Ser Val Asp Ala Thr 

1315 1320 1325 

Thr He Leu Gly He Gly Thr Val Leu Asp Gin Ala Glu Thr Ala Gly 

1330 1335 1340 

Val Arg Leu Thr Val Leu Ala Thr Ala Thr Pro Pro Gly Ser Val Thr 
1345 1350 1355 1360 

Thr Pro His Pro Asn He Glu Glu Val Ala Leu Gly Gin Glu Gly Glu 

1365 1370 1375 

He Pro Phe Tyr Gly Arg Ala Phe Pro Leu Ser Tyr He Lys Gly Gly 

1380 1385 " 1390 

Arg His Leu He Phe Cys His Ser Lys Lys Lys Cys Asp Glu Leu Ala 

1395 1400 " " " 1405 

Thr Ala Leu Arg Gly Met Gly Leu Asn Ala Val Ala Tyr Tyr Arg Gly 

1410 1415 1420 

Leu Asp Val Ser He He Pro Thr Gin Gly Asp Val Val Val Val Ala 
1425 1430 1435 1440 

Thr Asp Ala Leu Met Thr Gly Tyr Thr Gly Asp Phe Asp Ser Val He 

1445 1450 1455 

Asp Cys Asn Val Ala Val Thr Gin Ala Val Asp Phe Ser Leu Asp Pro 

1460 1465 ^ 1470 

Thr Phe Thr He Thr Thr Gin Thr Val Pro Gin Asp Ala Val Ser Arg 

1475 1480 1485 

Ser Gin Arg Arg Gly Arg Thr Gly Arg Gly Arg Leu Gly He Tyr Arg 

1490 1495 1500 

Tyr Val Ser Thr Gly Glu Arg Ala Ser Gly Met Phe Asp Ser Val Val 
1505 1510 1515 1520 

Leu Cys Glu Cys Tyr Asp Ala Gly Ala Ala Trp Tyr Glu Leu Ser Pro 

1525 1530 1535 

Val Glu Thr Thr Val Arg Leu Arg Ala Tyr Phe Asn Thr Pro Gly Leu 

1540 1545 1550 

Pro Val Cys Gin Asp His Leu Glu Phe Trp Glu Ala Val Phe Thr Gly 

1555 1560 1565 

Leu Thr His He Asp Ala His Phe Leu Ser Gin Thr Lys Gin Ser Gly 

1570 1575 1580 

Glu Asn Phe Ala Tyr Leu Val Ala Tyr Gin Ala Thr Val Cys Ala Arg 
1585 1590 1595 1600 

Ala Lys Ala Pro Pro Pro Ser Trp Asp Val Met Trp Lys Cys Leu Thr 
1605 1610 1615 
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Arg Leu Lys Pro Thr Leu Val Gly Pro Thr Pro Leu Leu Tyr Arg Leu 

1620 1625 1630 

Gly Ser Val Thr Asn Glu Val Thr Leu Thr His Pro Val Thr Lys Tyr 

1635 1640 1645 

He Ala Thr Cys Met Gin Ala Asp Leu Glu Val Met Thr Ser Thr Trp 

1650 1655 1660 

Val Leu Ala Gly Gly Val Leu Ala Ala Val Ala Ala Tyr Cys Leu Ala 
1665 1670 1675 1680 

Thr Gly Cys Val Ser He He Gly Arg Leu His He Asn Gin Arg Ala 

1685 1690 1695 

Val Val Ala Pro Asp Lys Glu Val Leu Tyr Glu Ala Phe Asp Glu Met 

1700 1705 1710 

Glu Glu Cys Ala Ser Arg Ala Ala Leu Leu Glu Glu Gly Gin Arg He 

1715 1720 1725 

Ala Glu Met Leu Lys Ser Lys He Gin Gly Leu Leu Gin Gin Ala Ser 

1730 1735 1740 

Lys Gin Ala Gin Asp He Gin Pro Ala Val Gin Ala Ser Trp Pro Lys 
1745 1750 1755 1760 

Met Glu Gin Phe Trp Ala Lys His Met Trp Asn Phe He Ser Gly He 

1765 1770 1775 

Gin Tyr Leu Ala Gly Leu Ser Thr Leu Pro Gly Asn Pro Ala Val Ala 

1780 1785 1790 

Ser Met Met Ala Phe Ser Ala Ala Leu Thr Ser Pro Leu Ser Thr Ser 

1795 1800 1805 

Thr Thr He Leu Leu Asn He Leu Gly Gly Trp Leu Ala Ser Gin He 

1810 1815 1820 

Ala Pro Pro Ala Gly Ala Thr Gly Phe Val Val Ser Gly Leu Val Gly 
1825 1830 1835 1840 

Ala Ala Val Gly Ser He Gly Leu Gly Lys Val Leu Val Asp He Leu 

1845 1850 1855 

Ala Gly Tyr Gly Ala Gly He Ser Gly Ala Leu Val Ala Phe Lys He 

1860 1865 1870 

Met Ser Gly Glu Lys Pro Ser Met Glu Asp Val He Asn Leu Leu Pro 

1875 - , 1880 g.885 

Gly He Leu Ser Pro Gly Ala Leu Val Val Gly Val He Cys Ala Ala 

1890 1895 1900 

He Leu Arg Arg His Val Gly Pro Gly Glu Gly Ala Val Gin Trp Met 
1905 1910 1915 1920 

Asn Arg Leu He Ala Phe Ala Ser Arg Gly Asn His Val Ala Pro Thr 

1925 1930 1935 

His Tyr Val Thr Glu Ser Asp Ala Ser Gin Arg Val Thr Gin Leu Leu 

1940 1945 1950 

Gly Ser Leu Thr He Thr Ser Leu Leu Arg Arg Leu His Asn Trp He 

1955 1960 1965 

Thr Glu Asp Cys Pro He Pro Cys Ala Gly Ser Trp Leu Arg Asp Val 

1970 1975 1980 

Trp Asp Trp Val Cys Thr He Leu Thr Asp Phe Lys Asn Trp Leu Thr 
1985 1990 1995 2000 

Ser Lys Leu Phe Pro Lys Met Pro Gly Leu Pro Phe He Ser Cys Gin 

2005 2010 2015 

Lys Gly Tyr Lys Gly Val Trp Ala Gly Thr Gly He Met Thr Thr Arg 

2020 ^ 2025 2030 

Cys Pro Cys Gly Ala Asn He Ser Gly Asn Val Arg Leu Gly Ser Met 

2035 2040 2045 

Arg He Thr Gly Pro Lys Thr Cys Met Asn Thr Trp Gin Gly Thr Phe 

2050 2055 2060 

Pro He Asn Cys Tyr Thr Glu Gly Gin Cys Leu Pro Lys Pro Ala Leu 
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2065 2070 2075 2080 

Asn Phe Lys Thr Ala lie Trp Arg Val Ala Ala Ser Glu Tyr Ala Glu 

2085 2090 2095 

Val Thr Gin His Gly Ser Tyr Ala Tyr lie Thr Gly Leu Thr Thr Asp 

2100 ^ " 2105 2110 

Asn Leu Lys Val Pro Cys Gin Leu Pro Ser Pro Glu Phe Phe Ser Trp 

2115 2120 2125 

Val Asp Gly Val Gin lie His Arg Ser Ala Pro Thr Pro Lys Pro Phe 

2130 ~ 2135 2140 

Phe Arg Asp Glu Val Ser Phe Ser Val Gly Leu Asn Ser Phe Val Val 
2145 2150 2155 2160 

Gly Ser Gin Leu Pro Cys Asp Pro Glu Pro Asp Thr Glu Val Val Met 

2165 2170 2175 

Ser Met Leu Thr Asp Pro Ser His lie Thr Ala Glu Ala Ala Ala Arg 

2180 2185 2190 

Arg Leu Ala Arg Gly Ser Pro Pro Ser Glu Ala Ser Ser Ser Ala Ser 

2195 2200 2205 

Gin Leu Ser Ala Pro Ser Leu Arg Ala Thr Cys Thr Thr His Gly Arg 

2210 2215 2220 

Thr Tyr Asp Val Asp Met Val Asp Ala Asn Leu Phe Met Gly Gly Gly 
2225 2230 2235 2240 

Val lie Arg lie Glu Ser Glu Ser Lys Val Val Val Leu Asp Ser Leu 

2245 2250 2255 

Asp Ser Met Thr Glu Glu Glu Gly Asp Leu Glu Pro Ser Val Pro Ser 

2260 2265 2270 

Glu Tyr Met Leu Pro Arg Lys Arg Phe Pro Pro Ala Leu Pro Ala Trp 

2275 2280 2285 

Ala Arg Pro Asp Tyr Asn Pro Pro Leu Val Glu Ser Trp Lys Arg Pro 

2290 ^ 2295 2300 

Asp Tyr Gin Pro Pro Thr Val Ala Gly Cys Ala Leu Pro Pro Pro Lys 
2305 2310 2315 2320 

Lys Thr Pro Thr Pro Pro Pro Arg Arg Arg Arg Thr Val Gly Leu Ser 

2325 2330 2335 

Glu Ser Thr lie Gly Asp Ala Leu Gin Gin Leu Ala lie Lys Ser Phe 

2340 2345 2350 

Gly Gin Pro Pro Pro Ser Gly Asp Ser Gly Leu Ser Thr Gly Ala Asp 

2355 2360 2365 

Ala Ala Asp Ser Gly Asp Arg Thr Pro Pro Asp Glu Leu Ala Leu Ser 

2370 2375 2380 

Glu Thr Gly Ser Thr Ser Ser Met Pro Pro Leu Glu Gly Glu Pro Gly 
2385 2390 2395- 2400 

Asp Pro Asp Leu Glu Pro Glu Gin Val Glu Leu Gin Pro Pro Pro Gin 

2405 2410 2415 

Gly Gly Glu Ala Ala Pro Gly Ser Asp Ser Gly Ser Trp Ser Thr Cys 

2420 2425 2430 

Ser Glu Glu Asp Asp Ser Val Val Cys Cys Ser Met Ser Tyr Ser Trp 

2435 2440 2445 

Thr Gly Ala Leu lie Thr Pro Cys Ser Pro Glu Glu Glu Lys Leu Pro 

2450 2455 2460 

lie Asn Ser Leu Ser Asn Ser Leu Leu Arg Tyr His Asn Lys Val Tyr 
2465 2470 2475 2480 

Cys Thr Thr Ser Lys Ser Ala Ser Leu Arg Ala Lys Lys Val Thr Phe 

2485 2490 2495 

Asp Arg Met Gin Val Leu Asp Ala Tyr Tyr Asp Ser Val Leu Lys Asp 

2500 2505 " 2510 

lie Lys Leu Ala Ala Ser Lys Val Ser Ala Arg Leu Leu Thr Leu Glu 
2515 2520 2525 
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Glu Ala Cys Gin Leu Thr Pro Pro His Ser Ala Arg Ser Lys Tyr Gly 

2530 2535 2540 

Phe Gly Ala Lys Glu Val Arg Ser Leu Ser Gly Arg Ala Val Asn His 

2545 2550 2555 2560 

lie Lys Ser Val Trp Lys Asp Leu Leu Glu Asp Ser Gin Thr Pro lie 

2565 2570 2575 

Pro Thr Thr lie Met Ala Lys Asn Glu Val Phe Cys Val Asp Pro Ala 

2580 2585 2590 

Lys Gly Gly Lys Lys Pro Ala Arg Leu lie Val Tyr Pro Asp Leu Gly 

2595 ~ 2600 2605 

Val Arg Val Cys Glu Lys Met Ala Leu Tyr Asp Val Thr Gin Lys Leu 

2610 ~ 2615 2620 

Pro Gin Ala Val Met Gly Ala Ser Tyr Gly Phe Gin Tyr Ser Pro Ala 

2625 2630 2635 2640 

Gin Arg Val Glu Phe Leu Leu Lys Ala Trp Ala Glu Lys Arg Asp Pro 

2645 2650 2655 

Met Gly Phe Ser Tyr Asp Thr Arg Cys Phe Asp Ser Thr Val Thr Glu 

2660 2665 2670 

Arg Asp lie Arg Thr Glu Glu Ser lie Tyr Gin Ala Cys Ser Leu Pro 

2675 2680 2685 

Glu Glu Ala Arg Thr Ala lie His Ser Leu Thr Glu Arg Leu Tyr Val 

2690 2695 2700 

Gly Gly Pro Met Phe Asn Ser Lys Gly Gin Ser Cys Gly Tyr Arg Arg 

2705 " 2710 ~ 2715 ' ~ ^ 2720 
Cys Arg Ala Ser Gly Val Leu Thr Thr Ser Met Gly Asn Thr lie Thr 

2725 2730 2735 

Cys Tyr Val Lys Ala Leu Ala Ala Cys Lys Ala Ala Gly lie lie Ala 

2740 2745 2750 

Pro Thr Met Leu Val Cys Gly Asp Asp Leu Val Val lie Ser Glu Ser 

2755 2760 2765 

Gin Gly Thr Glu Glu Asp Glu Arg Asn Leu Arg Ala Phe Thr Glu Ala 

2770 2775 2780 

Met Thr Arg Tyr Ser Ala Pro Pro Gly Asp Pro Pro Arg Pro Glu Tyr 

2785 • 2790 ^2795 2800 

Asp Leu Glu Leu lie Thr Ser Cys Ser Ser Asn Val Ser Val Ala Leu 

2805 2810 2815 

Gly Pro Gin Gly Arg Arg Arg Tyr Tyr Leu Thr Arg Asp Pro Thr Thr 

2820 2825 2830 

Ser lie Ala Arg Ala Ala Trp Glu Thr Val Arg His Ser Pro Val Asn 

2835 2840 2845 

Ser Trp Leu Gly Asn lie lie Gin Tyr Ala Pro Thr lie Trp Val Arg 

2850 2855 2860 

Met Val Leu Met Thr His Phe Phe Ser lie Leu Met Ala Gin Asp Thr 

2865 2870 2875 2880 

Leu Asp Gin Asn Leu Asn Phe Glu Met Tyr Gly Ser Val Tyr Ser Val 

2885 2890 2895 

Ser Pro Leu Asp Leu Pro Ala lie lie Glu Arg Leu His Gly Leu Asp 

2900 2905 2910 

Ala Phe Ser Leu His Thr Tyr Thr Pro His Glu Leu Thr Arg Val Ala 

2915 2920 2925 

Ser Ala Leu Arg Lys Leu Gly Ala Pro Pro Leu Arg Ala Trp Lys Ser 

2930 2935 2940 

Arg Ala Arg Ala Val Arg Ala Ser Leu lie Ser Arg Gly Gly Arg Ala 

2945 2950 2955 2960 

Ala Val Cys Gly Arg Tyr Leu Phe Asn Trp Ala Val Lys Thr Lys Leu 

2965 " 2970 ^ 2975 
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Lys Leu Thr Pro Leu Pro Glu Ala Arg Leu Leu Asp Leu Ser Ser Trp 

2980 2985 2990 

Phe Thr Val Gly Ala Gly Gly Gly Asp He Tyr His Ser Val Ser Arg 

2995 3000 3005 

Ala Arg Pro Arg Leu Leu Leu Leu Ser Leu Leu Leu Leu Ser Val Gly 

3010 ' 3015 3020 

Val Gly Leu Phe Leu Leu Pro Ala Arg 
3025 3030 



<210> 7 

<211> 8024 

<212> RNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: 



replicon 



<400> 7 

accugccccu 

cuucacgcag 

cccccucccg 

aagacugggu 

caagacugcu 

cgcuugcgag 

ucaaagaaaa 

cgcagguucu 

aaucggcugc 

ugucaagacc 

guggcuggcc 

aagggacugg 

uccugccgag 

ggcuaccugc 

ggaagccggu 

cgaacuguuc 

uggcgaugcc 

cuguggccgg 

ugcugaagag 

ucccgauucg 

ccucucccuc 

cguuugucua 

aaccuggccc 

ugcaaggucu 

caacgucugu 

gcggccaaaa 

uugugaguug 

ggcugaagga 

caugcuuuac 

acgugguuuu 

caaacacgag 

caggccgggg 

ucggggguuu 

gguccgguca 

ccugggacca 

cggaacgcug 

ccgagaccca 



aauaggggcg 
aaagcgccua 
ggagagccau 
ccuuucuugg 
agccgaguag 
ugccccggga 
accaaaagaa 
ccggccgcuu 
ucugaugccg 
gaccuguccg 
acgacgggcg 
cugcuauugg 
aaaguaucca 
ccauucgacc 
cuugucgauc 
gccaggcuca 
ugcuugccga 
cugggugugg 
cuuggcggcg 
cagcgcaucg 
cccccccccu 
uauguuauuu 
ugucuucuug 
guugaauguc 
agcgacccuu 
gccacgugua 
gauaguugug 
ugcccagaag 
auguguuuag 
ccuuugaaaa 
gccuccuggg 
aaguccaaau 
uguggacugu 
cgcagaugua 
agucuuugga 
augucauccc 
uuucgaccuu 



acacuccgcc 
gccauggcgu 
aguggucugc 
auaaacccac 
cguuggguug 
ggucucguag 
acaccaaccg 
ggguggagag 
ccguguuccg 
gugcccugaa 
uuccuugcgc 
gcgaagugcc 
ucauggcuga 
accaagcgaa 
aggaugaucu 
aggcgcgcau 
auaucauggu 
cggaccgcua 
aaugggcuga 
ccuucuaucg 
aacguuacug 
uccaccauau 
acgagcauuc 
gugaaggaag 
ugcaggcagc 
uaagauacac 
gaaagaguca 
guaccccauu 
ucgagguuaa 
acacgaugau 
cgccauagug 
ccuguccaca 
uuaccacgga 
cucgagugcu 
gccgugcaag 
ggcucggaga 
gaaggggucc 



augaaucacu 
uaguaugagu 
ggaaccggug 
ucuaugcccg 
cgaaaggccu 
accgugcacc 
ucgcccaaug 
gcuauucggc 
gcugucagcg 
ugaacugcag 
agcugugcuc 
ggggcaggau 
ugcaaugcgg 
acaucgcauc 
ggacgaagag 
gcccgacggc 
ggaaaauggc 
ucaggacaua 
ccgcuuccuc 
ccuucuugac 
gccgaagccg 
ugccgucuuu 
cuaggggucu 
caguuccucu 
ggaacccccc 
cugcaaaggc 
aauggcucuc 
guaugggauc 
aaaaacgucu 
accauggcuc 
gugaguauga 
gucucucagu 
gcuggcaaca 
gagggggacu 
uguggagccg 
cgcggggaca 
ucgggggggc 



ccccugugag 
gucguacagc 
aguacaccgg 
gccauuuggg 
ugugguacug 
augagcacaa 
auugaacaag 
uaugacuggg 
caggggcgcc 
gacgaggcag 
gacguuguca 
cuccugucau 
cggcugcaua 
gagcgagcac 
caucaggggc 
gaggaucucg 
cgcuuuucug 
gcguuggcua 
gugcuuuacg 
gaguucuucu 
cuuggaauaa 
uggcaaugug 
uuccccucuc 
ggaagcuucu 
accuggcgac 
ggcacaaccc 
cucaagcgua 
ugaucugggg 
aggccccccg 
ccaucacugc 
cggggcguga 
ccuuccucgg 
agacucuagc 
ugguaggcug 
ucgaccuaua 
agcggggagc 
cggugcucug 



gaacuacugu 
cuccaggccc 
aauugccggg 
cgugcccccg 
ccugauaggg 
auccuaaacc 
auggauugca 
cacaacagac 
cgguucuuuu 
cgcggcuauc 
cugaagcggg 
cucaccuugc 
cgcuugaucc 
guacucggau 
ucgcgccagc 
ucgugaccca 
gauucaucga 
cccgugauau 
guaucgccgc 
gaguuuaaac 
ggccggugug 
agggcccgga 
gccaaaggaa 
ugaagacaaa 
aggugccucu 
cagugccacg 
uucaacaagg 
ccucggugca 
aaccacgggg 
uuaugcccag 
caggacagaa 
aacaaccauc 
cggcuuacgg 
gcccagcccc 
ucuggucacg 
auugcucucc 
cccuaggggc 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1800 

1860 

1920 

1980 

2040 

2100 

2160 

2220 
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cacgucguug 
uucauccccg 
acgccaccgg 
ggaaagagca 
aaccccucgg 
aaucccaaca 
acauauggca 
ugcgaugaau 
caagcagaga 
gugacaaccc 
uucuauggga 
cacucaaaga 
guggcauacu 
gucgccaccg 
aauguagcgg 
cagacugucc 
agacagggca 
guagugcuuu 
accaccguca 
cuugaauuuu 
caaacaaagc 
gccagagcca 
aagccuacgc 
gucacccuca 
gucaugacca 
cuggcgacug 
gcgccggaua 
gcggcucuca 
uugcugcagc 
cccaaagugg 
cucgcaggau 
gccgcccuca 
ugguuagcgu 
gugggggcug 
uauggugcgg 
ucuauggaag 
ggggucaucu 
uggaugaaca 
gugacggagu 
agccuacuca 
uccuggcucc 
cugaccucua 
uacaagggug 
aucucuggca 
accuggcagg 
cccacgaacu 
cagcaugggu 
caacuaccuu 
cccacaccaa 
gcugucgggu 
cuaacagauc 
ccuccaucug 
ugcaccaccc 
ggcggugugg 
auggccgagg 
agcggguuuc 
gaaucgugga 



ggcucuuccg 
uugagacacu 
cugugcccca 
ccaagguccc 
uagcugccac 
uuaggacugg 
aauuucucgc 
gccacgcugu 
cagccggggu 
cccaucccga 
gggcgauucc 
aaaaguguga 
auagaggguu 
acgcccucau 
ucacccaagc 
cacaagacgc 
cuuauaggua 
gugagugcua 
ggcuuagagc 
gggaggcagu 
aagcggggga 
aggccccucc 
uugcgggccc 
cacacccugg 
gcacgugggu 
gaugcguuuc 
aggagguccu 
ucgaagaggg 
aggccucuaa 
aacaauuuug 
ugucaacacu 
ccaguccguu 
cccagaucgc 
ccgugggcag 
gcauuucggg 
augucaucaa 
gcgcggccau 
ggcuuauugc 
cggaugcguc 
gaagacucca 
gcgacgugug 
aauuguuccc 
ugugggccgg 
auguccgccu 
ggaccuuucc 
acaagaccgc 
cguacuccua 
cuccagaguu 
agccguuuuu 
cccagcuucc 
cgccccacau 
aggcgagcuc 
acagcaacac 
cucagacaga 
aagagagcga 
cacgggccuu 
ggaggccaga 



agcagcugug 
cgacguuguu 
gaccuaucag 
ugucgcguau 
ccugggguuu 
agucaggacc 
cgaugggggc 
ggaugcuacc 
cagacuaacu 
uauagaaaag 
ccuauccugc 
cgagcucgcg 
ggacgucucc 
gacgggguac 
ugucgacuuc 
ugucucacgc 
uguuuccacu 
cgacgcaggg 
guauuucaac 
uuucaccggc 
gaacuucgcg 
cccguccugg 
cacaccucuc 
gacgaaguac 
ccuagcugga 
caucaucggc 
guaugaggcu 
gcagcggaua 
gcaggcccag 
ggccagacac 
gccagggaac 
gucgaccagu 
accacccgcg 
cauaggccug 
ggcccucguc 
ucuacugccu 
ucugcgccgc 
cuuugcuucc 
gcagcgugug 
caauuggaua 
ggacuggguu 
caagcugccc 
cacuggcauc 
gggcucuaug 
uaucaauugc 
caucuggagg 
uguaacagga 
uuucuccugg 
ccgggaugag 
cugugaaccu 
cacggcggag 
cucagugagc 
cuaugacgug 
gccugagucc 
ccuugagccc 
accggcuugg 
uuaccaaccg 



ugcucucggg 
acaaggucuc 
gucggguacu 
gccgcccagg 
ggggcguacc 
gugaugaccg 
ugcgcuagcg 
uccauucucg 
gugcuggcua 
guaggccucg 
aucaagggag 
gcggcccuuc 
auaauaccag 
acuggagacu 
agccuggacc 
agucagcgcc 
ggugaacgag 
gcugcguggu 
acgcccggcc 
cucacacaca 
uaccuaguag 
gacgccaugu 
cuguaccguu 
aucgccacau 
ggaguccugg 
cgcuugcacg 
uuugaugaga 
gccgagaugu 
gacauacaac 
auguggaacu 
cccgcggugg 
accaccaucc 
ggggccaccg 
gguaaggugc 
gcauucaaga 
gggauccugu 
cacgugggac 
agaggaaacc 
acccaacuac 
acugaggacu 
ugcaccaucu 
ggccuccccu 
augaccacgc 
aggaucacag 
uacacggagg 
guggcggccu 
cugaccacug 
guggacggug 
gucucguucu 
gagcccgacg 
acugcggcgc 
cagcuaucag 
gacauggucg 
agggugcccg 
ucaauaccau 
gcacggccug 
cccaccguug 



gcguggccaa 
ccacuuucag 
ugcaugcucc 
gguacaaagu 
uauccaaggc 
gggaggccau 
gcgccuauga 
gcaucggaac 
cggccacacc 
ggcgggaggg 
ggagacaccu 
ggggcauggg 
cucagggaga 
uugacuccgu 
ccaccuucac 
gcgggcgcac 
ccucaggaau 
acgaucucac 
uacccgugug 
uagacgccca 
ccuaccaagc 
ggaagugccu 
ugggcccuau 
gcaugcaagc 
cagccgucgc 
ucaaccagcg 
uggaggaaug 
ugaaguccaa 
ccgcuaugca 
ucauuagcgg 
cuuccaugau 
uucucaacau 
gcuuugucgu 
ugguggacau 
ucaugucugg 
cuccgggagc 
cgggggaggg 
acgucgcccc 
uuggcucucu 
gccccauccc 
ugacagacuu 
ucaucucuug 
gcugcccuug 
ggccuaaaac 
gccagugcgc 
cggaguacgc 
acaaucugaa 
ugcagaucca 
gcguugggcu 
cagacguauu 
ggcgcuuggc 
caccgucgcu 
augccaaccu 
uucuggacuu 
cggagugcau 
acuacaaccc 
cugguugugc 



auccaucgau 
ugacaacagc 
aacuggcagu 
acuagugcuu 
acauggcauc 
cacguacucc 
caucaucaua 
gguccuugau 
ccccggguca 
ugagaucccc 
gauuuucugc 
cuugaaugcc 
uguggugguc 
gaucgacugc 
uauaaccaca 
agguagagga 
guuugacagu 
accagcggag 
ucaagaccau 
cuuccucucc 
uacggugugc 
ggcccgacuc 
uaccaaugag 
ugaccuugag 
cgcauauugc 
agucgucguu 
cgccucuagg 
gauccaaggc 
ggcuucaugg 
cauccaauac 
ggcauucagu 
caugggaggc 
caguggccug 
ccuggcagga 
cgagaagccc 
ccugguggug 
cgcgguccaa 
uacucacuac 
uacuauaacc 
augcuccgga 
caaaaauugg 
ucaaaagggg 
cggcgccaac 
cugcaugaac 
gccgaaaccc 
ggaggugacg 
aauuccuugc 
uagguuugca 
uaauuccuau 
gagguccaug 
acggggauca 
gcgggccacc 
gcucauggag 
ucucgagcca 
gcuccccagg 
gccgcucgug 
ucuccccccc 



2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2820 
2880 
2940 
3000 
3060 
3120 
3180 
3240 
3300 
3360 
3420 
3480 
3540 
3600 
3660 
3720 
3780 
3840 
3900 
3960 
4020 
4080 
4140 
4200 
4260 
4320 
4380 
4440 
4500 
4560 
4620 
4680 
4740 
4800 
4860 
4920 
4980 
5040 
5100 
5160 
5220 
5280 
5340 
5400 
5460 
5520 
5580 
5640 
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cccaagaagg 
accauaucag 
ggugaugcag 
ggugagccgg 
ccuggagauc 
gggguagcuc 
accgugugcu 
gaagaggaaa 
guguacugua 
acgcaagugc 
aaggucagcg 
gcaagaucca 
aaccacauca 
accaucaugg 
gcucgccuca 
gacauuacac 
ccugcccaac 
uuuucguaug 
gaguccauau 
acugagagac 
agacguugcc 
gugaaagccc 
ggcaaugacc 
agagccuuca 
gaauaugacc 
cggggccgcc 
ugggaaacag 
ccaaccauau 
gacacccugg 
uuggaccuuc 
uacucucacc 
cucagggugu 
aaagcggccg 
acuccauugc 
gggggcgaca 
cuccuacuuu 
uagguacacu 
uuuuuuuuuu 
uuucuuggug 
augacugcag 



ccccgacgcc 
aagcccucca 
gcucguccac 
cccccucaga 
cggaccugga 
ccgguucggg 
gcuccauguc 
aguugccaau 
caacaucaaa 
ucgacgccca 
caaggcuccu 
aguauggauu 
aguccgugug 
ccaaaaauga 
ucguuuaccc 
aaaagcuucc 
ggguggagua 
auacccgaug 
accaggccug 
uuuacguagg 
gcgccagcgg 
uagcggccug 
uaguagucau 
cggaggccau 
uggagcuaau 
gcagauacua 
uuagacacuc 
ggguucgcau 
accagaaccu 
cagccauaau 
acgaacugac 
ggaagagucg 
uuugcggccg 
cggaggcgcg 
uuuuucacag 
ucguaggggu 
ccauagcuaa 
cuuuuuuuuu 
gcuccaucuu 
agagugccgu 



ucccccaagg 
gcaacuggcc 
gggggcgggc 
gacagguucc 
gucugaucag 
cucggggucu 
auacuccugg 
caacccuuug 
gagcgccuca 
uuaugacuca 
caccuuggag 
cggggccaag 
gaaggaccuc 
gguguucugc 
ugaccucggc 
ucaggcggua 
ucucuugaaa 
cuucgacuca 
cucccugccc 
agggcccaug 
ggugcuaacc 
caaggcugcg 
cucagaaagc 
gaccagguac 
aacauccugu 
ccugaccaga 
cccuaucaau 
gguccuaaug 
caacuuugag 
ugagagguua 
gcggguggcu 
ggcucgcgca 
auaucucuuc 
ccuacuggac 
cgugucgcgc 
aggccucuuc 
cuguuccuuu 
uuuuucccuc 
agcccuaguc 
aacuggucuc 



agacgccgga 
aucaagaccu 
gccgccgaau 
gccuccucua 
guagagcuuc 
uggucuacuu 
accggggcuc 
aguaacucgc 
cagagggcua 
gucuuaaagg 
gaggcgugcc 
gagguccgca 
cuggaagacc 
guggaccccg 
guccgggucu 
augggagcuu 
gcaugggcgg 
accgucacug 
gaggaggccc 
uucaacagca 
acuagcaugg 
gggauaguug 
caggggacug 
ucugccccuc 
uccucaaaug 
gacccaacca 
ucauggcugg 
acacacuucu 
auguauggau 
cacgggcuug 
ucagcccuca 
gucagggcgu 
aauugggcgg 
uuauccaguu 
gcccgacccc 
cuacuccccg 
uuuuuuuuuu 
uuucuucccu 
acggcuagcu 
ucugcagauc 



cagugggucu 
uuggccagcc 
ccggcggucc 
ugcccccccu 
aaccuccccc 
gcuccgagga 
uaauaacucc 
uguugcgaua 
aaaagguaac 
acaucaagcu 
aguugacucc 
gcuuguccgg 
cacaaacacc 
ccaagggggg 
gcgagaaaau 
ccuauggcuu 
aaaagaagga 
agagagacau 
gcacugccau 
agggucaaac 
guaacaccau 
cgcccacaau 
aggaggacga 
cuggugaucc 
ugucuguggc 
cuccacucgc 
gaaacaucau 
ucuccauucu 
caguauacuc 
acgccuuuuc 
gaaaacuugg 
cccucaucuc 
ugaagaccaa 
gguucaccgu 
gcucauuacu 
cucgguagag 
uuuuuuuuuu 
ucucaucuua 
gugaaagguc 
augu 



gagcgagagc 
ccccucgagc 
gacguccccu 
cgagggggag 
ccaggggggg 
ggacgauacc 
cuguagcccc 
ccauaacaag 
uuuugacagg 
agcggcuucc 
accccauucu 
gagggccguu 
aauucccaca 
uaagaaacca 
ggcccucuau 
ccaguacucc 
ccccaugggu 
caggaccgag 
acacucgcug 
cugcgguuac 
cacaugcuau 
gcugguaugc 
gcggaaccug 
ccccagaccg 
guugggcccg 
ccgggcugcc 
ccaguaugcu 
caugguccaa 
cgugaauccu 
uaugcacaca 
ggcgccaccc 
ccguggaggg 
gcucaaacuc 
cggcgccggc 
cuucggccua 
cggcac.acac 
uuuuuuuuuu 
uucuacuuuc 
cgugagccgc 



5700 
5760 
5820 
5880 
5940 
6000 
6060 
6120 
6180 
6240 
6300 
6360 
6420 
6480 
6540 
6600 
6660 
6720 
6780 
6840 
6900 
6960 
7020 
7080 
7140 
7200 
7260 
7320 
7380 
7440 
7500 
7560 
7620 
7680 
7740 
7800 
7860 
7920 
7980 
8024 



<210> 8 
<211> 7994 
<212> RNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: replicon 



<400> 8 

accugccccu 

cuucacgcag 

cccccucccg 

aagacugggu 

caagacugcu 

cgcuugcgag 



aauaggggcg 
aaagcgccua 
ggagagccau 
ccuuucuugg 
agccgaguag 
ugccccggga 



acacuccgcc 
gccauggcgu 
aguggucugc 
auaaacccac 
cguuggguug 
ggucucguag 



augaaucacu 
uaguaugagu 
ggaaccggug 
ucuaugcccg 
cgaaaggccu 
accgugcacc 



ccccugugag 
gucguacagc 
aguacaccgg 
gccauuuggg 
ugugguacug 
augagcacaa 



gaacuacugu 60 
cuccaggccc 120 
aauugccggg 180 
cgugcccccg 240 
ccugauaggg 300 
auccuaaacc 360 
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ucaaagaaaa 
cgcagguucu 
aaucggcugc 
ugucaagacc 
guggcuggcc 
aagggacugg 
uccugccgag 
ggcuaccugc 
ggaagccggu 
cgaacuguuc 
uggcgaugcc 
cuguggccgg 
ugcugaagag 
ucccgauucg 
ccucucccuc 
cguuugucua 
aaccuggccc 
ugcaaggucu 
caacgucugu 
gcggccaaaa 
uugugaguug 
ggcugaagga 
caugcuuuac 
acgugguuuu 
caaacacgag 
caggccgggg 
ucggggguuu 
gguccgguca 
ccugggacca 
cggaacgcug 
ccgagaccca 
cacgucguug 
uucauccccg 
acgccaccgg 
ggaaagagca 
aaccccucgg 
aaucccaaca 
acauauggca 
ugcgaugaau 
caagcagaga 
gugacaaccc 
uucuauggga 
cacucaaaga 
guggcauacu 
gucgccaccg 
aauguagcgg 
cagacugucc 
agacagggca 
guagugcuuu 
accaccguca 
cuugaauuuu 
caaacaaagc 
gccagagcca 
aagccuacgc 
gucacccuca 
gucaugacca 
cuggcgacug 



accaaaagaa 
ccggccgcuu 
ucugaugccg 
gaccuguccg 
acgacgggcg 
cugcuauugg 
aaaguaucca 
ccauucgacc 
cuugucgauc 
gccaggcuca 
ugcuugccga 
cugggugugg 
cuuggcggcg 
cagcgcaucg 
cccccccccu 
uauguuauuu 
ugucuucuug 
guugaauguc 
agcgacccuu 
gccacgugua 
gauaguugug 
ugcccagaag 
auguguuuag 
ccuuugaaaa 
gccuccuggg 
aaguccaaau 
uguggacugu 
cgcagaugua 
agucuuugga 
augucauccc 
uuucgaccuu 
ggcucuuccg 
uugagacacu 
cugugcccca 
ccaagguccc 
uagcugccac 
uuaggacugg 
aauuucucgc 
gccacgcugu 
cagccggggu 
cccaucccga 
gggcgauucc 
aaaaguguga 
auagaggguu 
acgcccucau 
ucacccaagc 
cacaagacgc 
cuuauaggua 
gugagugcua 
ggcuuagagc 
gggaggcagu 
aagcggggga 
aggccccucc 
uugcgggccc 
cacacccugg 
gcacgugggu 
gaugcguuuc 



acaccaaccg 
ggguggagag 
ccguguuccg 
gugcccugaa 
uuccuugcgc 
gcgaagugcc 
ucauggcuga 
accaagcgaa 
aggaugaucu 
aggcgcgcau 
auaucauggu 
cggaccgcua 
aaugggcuga 
ccuucuaucg 
aacguuacug 
uccaccauau 
acgagcauuc 
gugaaggaag 
ugcaggcagc 
uaagauacac 
gaaagaguca 
guaccccauu 
ucgagguuaa 
acacgaugau 
cgccauagug 
ccuguccaca 
uuaccacgga 
cucgagugcu 
gccgugcaag 
ggcucggaga 
gaaggggucc 
agcagcugug 
cgacguuguu 
gaccuaucag 
ugucgcguau 
ccugggguuu 
agucaggacc 
cgaugggggc 
ggaugcuacc 
cagacuaacu 
uauagaagag 
ccuauccugc 
cgagcucgcg 
ggacgucucc 
gacgggguac 
ugucgacuuc 
ugucucacgc 
uguuuccacu 
cgacgcaggg 
guauuucaac 
uuucaccggc 
gaacuucgcg 
cccguccugg 
cacaccucuc 
gacgaaguac 
ccuagcugga 
caucaucggc 



ucgcccaaug 
gcuauucggc 
gcugucagcg 
ugaacugcag 
agcugugcuc 
ggggcaggau 
ugcaaugcgg 
acaucgcauc 
ggacgaagag 
gcccgacggc 
ggaaaauggc 
ucaggacaua 
ccgcuuccuc 
ccuucuugac 
gccgaagccg 
ugccgucuuu 
cuaggggucu 
caguuccucu 
ggaacccccc 
cugcaaaggc 
aauggcucuc 
guaugggauc 
aaaaacgucu 
accauggcuc 
gugaguauga 
gucucucagu 
gcuggcaaca 
gagggggacu 
uguggagccg 
cgcggggaca 
ucgggggggc 
ugcucucggg 
acaaggucuc 
gucggguacu 
gccgcccagg 
ggggcguacc 
gugaugaccg 
ugcgcuagcg 
uccauucucg 
gugcuggcua 
guaggccucg 
aucaagggag 
gcggcccuuc 
auaauaccag 
acuggagacu 
agccuggacc 
agucagcgcc 
ggugaacgag 
gcugcguggu 
acgcccggcc 
cucacacaca 
uaccuaguag 
gacgccaugu 
cuguaccguu 
aucgccacau 
ggaguccugg 
cgcuugcacg 



auugaacaag 
uaugacuggg 
caggggcgcc 
gacgaggcag 
gacguuguca 
cuccugucau 
cggcugcaua 
gagcgagcac 
caucaggggc 
gaggaucucg 
cgcuuuucug 
gcguuggcua 
gugcuuuacg 
gaguucuucu 
cuuggaauaa 
uggcaaugug 
uuccccucuc 
ggaagcuucu 
accuggcgac 
ggcacaaccc 
cucaagcgua 
ugaucugggg 
aggccccccg 
ccaucacugc 
cggggcguga 
ccuuccucgg 
agacucuagc 
ugguaggcug 
ucgaccuaua 
agcggggagc 
cggugcucug 
gcguggccaa 
ccacuuucag 
ugcaugcucc 
gguacaaagu 
uauccaaggc 
gggaggccau 
gcgccuauga 
gcaucggaac 
cggccacacc 
ggcgggaggg 
ggagacaccu 
ggggcauggg 
cucagggaga 
uugacuccgu 
ccaccuucac 
gcgggcgcac 
ccucaggaau 
acgaucucac 
uacccgugug 
uagacgccca 
ccuaccaagc 
ggaagugccu 
ugggcccuau 
gcaugcaagc 
cagccgucgc 
ucaaccagcg 



auggauugca 
cacaacagac 
cgguucuuuu 
cgcggcuauc 
cugaagcggg 
cucaccuugc 
cgcuugaucc 
guacucggau 
ucgcgccagc 
ucgugaccca 
gauucaucga 
cccgugauau 
guaucgccgc 
gaguuuaaac 
ggccggugug 
agggcccgga 
gccaaaggaa 
ugaagacaaa 
aggugccucu 
cagugccacg 
uucaacaagg 
ccucggugca 
aaccacgggg 
uuaugcccag 
caggacagaa 
aacaaccauc 
cggcuuacgg 
gcccagcccc 
ucuggucacg 
auugcucucc 
cccuaggggc 
auccaucgau 
ugacaacagc 
aacuggcagu 
acuagugcuu 
acauggcauc 
cacguacucc 
caucaucaua 
gguccuugau 
ccccggguca 
ugagaucccc 
gauuuucugc 
cuugaaugcc 
uguggugguc 
gaucgacugc 
uauaaccaca 
agguagagga 
guuugacagu 
accagcggag 
ucaagaccau 
cuuccucucc 
uacggugugc 
ggcccgacuc 
uaccaaugag 
ugaccuugag 
cgcauauugc 
agucgucguu 



420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1800 

1860 

1920 

1980 

2040 

2100 

2160 

2220 

2280 

2340 

2400 

2460 

2520 

2580 

2640 

2700 

2760 

2820 

2880 

2940 

3000 

3060 

3120 

3180 

3240 

3300 

3360 

3420 

3480 

3540 

3600 

3660 

3720 

3780 
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gcgccggaua 
gcggcucuca 
uugcugcagc 
cccaaagugg 
cucgcaggau 
gccgcccuca 
ugguuagcgu 
gugggggcug 
uauggugcgg 
ucuauggaag 
ggggucaucu 
uggaugaaca 
gugacggagu 
agccuacuca 
uccuggcucc 
cugaccucua 
uacaagggug 
aucucuggca 
accuggcagg 
cccacgaacu 
cagcaugggu 
caacuaccuu 
cccacaccaa 
gcugucgggu 
cuaacagauc 
ccuccaucug 
ugcaccaccc 
ggcggugugg 
auggccgagg 
agcggguuuc 
gaaucgugga 
cccaagaagg 
accauaucag 
ggugaugcag 
ggugagccgg 
ccuggagauc 
gggguagcuc 
accgugugcu 
gaagaggaaa 
guguacugua 
acgcaagugc 
aaggucagcg 
gcaagaucca 
aaccacauca 
accaucaugg 
gcucgccuca 
gacauuacac 
ccugcccaac 
uuuucguaug 
gaguccauau 
acugagagac 
agacguugcc 
gugaaagccc 
caggggacug 
ucugccccuc 
uccucaaaug 
gacccaacca 



aggagguccu 
ucgaagaggg 
aggccucuaa 
aacaauuuug 
ugucaacacu 
ccaguccguu 
cccagaucgc 
ccgugggcag 
gcauuucggg 
augucaucaa 
gcgcggccau 
ggcuuauugc 
cggaugcguc 
gaagacucca 
gcgacgugug 
aauuguuccc 
ugugggccgg 
auguccgccu 
ggaccuuucc 
acaagaccgc 
cguacuccua 
cuccagaguu 
agccguuuuu 
cccagcuucc 
cgccccacau 
aggcgagcuc 
acagcaacac 
cucagacaga 
aagagagcga 
cacgggccuu 
ggaggccaga 
ccccgacgcc 
aagcccucca 
gcucguccac 
cccccucaga 
cggaccugga 
ccgguucggg 
gcuccauguc 
aguugccaau 
caacaucaaa 
ucgacgccca 
caaggcuccu 
aguauggauu 
aguccgugug 
ccaaaaauga 
ucguuuaccc 
aaaagcuucc 
ggguggagua 
auacccgaug 
accaggccug 
uuuacguagg 
gcgccagcgg 
uagcggccug 
aggaggacga 
cuggugaucc 
ugucuguggc 
cuccacucgc 



guaugaggcu 
gcagcggaua 
gcaggcccag 
ggccagacac 
gccagggaac 
gucgaccagu 
accacccgcg 
cauaggccug 
ggcccucguc 
ucuacugccu 
ucugcgccgc 
cuuugcuucc 
gcagcgugug 
caauuggaua 
ggacuggguu 
caagcugccc 
cacuggcauc 
gggcucuaug 
uaucaauugc 
caucuggagg 
uguaacagga 
uuucuccugg 
ccgggaugag 
cugugaaccu 
cacggcggag 
cucagugagc 
cuaugacgug 
gccugagucc 
ccuugagccc 
accggcuugg 
uuaccaaccg 
ucccccaagg 
gcaacuggcc 
gggggcgggc 
gacagguucc 
gucugaucag 
cucggggucu 
auacuccugg 
caacccuuug 
gagcgccuca 
uuaugacuca 
caccuuggag 
cggggccaag 
gaaggaccuc 
gguguucugc 
ugaccucggc 
ucaggcggua 
ucucuugaaa 
cuucgacuca 
cucccugccc 
agggcccaug 
ggugcuaacc 
caaggcugcg 
gcggaaccug 
ccccagaccg 
guugggcccg 
ccgggcugcc 



uuugaugaga 
gccgagaugu 
gacauacaac 
auguggaacu 
cccgcggugg 
accaccaucc 
ggggccaccg 
gguaaggugc 
gcauucaaga 
gggauccugu 
cacgugggac 
agaggaaacc 
acccaacuac 
acugaggacu 
ugcaccaucu 
ggccuccccu 
augaccacgc 
aggaucacag 
uacacggagg 
guggcggccu 
cugaccacug 
guggacggug 
gucucguucu 
gagcccgacg 
acugcggcgc 
cagcuaucag 
gacauggucg 
agggugcccg 
ucaauaccau 
gcacggccug 
cccaccguug 
agacgccgga 
aucaagaccu 
gccgccgaau 
gccuccucua 
guagagcuuc 
uggucuacuu 
accggggcuc 
aguaacucgc 
cagagggcua 
gucuuaaagg 
gaggcgugcc 
gagguccgca 
cuggaagacc 
guggaccccg 
guccgggucu 
augggagcuu 
gcaugggcgg 
accgucacug 
gaggaggccc 
uucaacagca 
acuagcaugg 
gggauaguug 
agagccuuca 
gaauaugacc 
cggggccgcc 
ugggaaacag 



uggaggaaug 
ugaaguccaa 
ccgcuaugca 
ucauuagcgg 
cuuccaugau 
uucucaacau 
gcuuugucgu 
ugguggacau 
ucaugucugg 
cuccgggagc 
cgggggaggg 
acgucgcccc 
uuggcucucu 
gccccauccc 
ugacagacuu 
ucaucucuug 
gcugcccuug 
ggccuaaaac 
gccagugcgc 
cggaguacgc 
acaaucugaa 
ugcagaucca 
gcguugggcu 
cagacguauu 
ggcgcuuggc 
caccgucgcu 
augccaaccu 
uucuggacuu 
cggagugcau 
acuacaaccc 
cugguugugc 
cagugggucu 
uuggccagcc 
ccggcggucc 
ugcccccccu 
aaccuccccc 
gcuccgagga 
uaauaacucc 
uguugcgaua 
aaaagguaac 
acaucaagcu 
aguugacucc 
gcuuguccgg 
cacaaacacc 
ccaagggggg 
gcgagaaaau 
ccuauggcuu 
aaaagaagga 
agagagacau 
gcacugccau 
agggucaaac 
guaacaccau 
cgcccacaau 
cggaggccau 
uggagcuaau 
gcagauacua 
uuagacacuc 



cgccucuagg 
gauccaaggc 
ggcuucaugg 
cauccaauac 
ggcauucagu 
caugggaggc 
caguggccug 
ccuggcagga 
cgagaagccc 
ccugguggug 
cgcgguccaa 
uacucacuac 
uacuauaacc 
augcuccgga 
caaaaauugg 
ucaaaagggg 
cggcgccaac 
cugcaugaac 
gccgaaaccc 
ggaggugacg 
aauuccuugc 
uagguuugca 
uaauuccuau 
gagguccaug 
acggggauca 
gcgggccacc 
gcucauggag 
ucucgagcca 
gcuccccagg 
gccgcucgug 
ucuccccccc 
gagcgagagc 
ccccucgagc 
gacguccccu 
cgagggggag 
ccaggggggg 
ggacgauacc 
cuguagcccc 
ccauaacaag 
uuuugacagg 
agcggcuucc 
accccauucu 
gagggccguu 
aauucccaca 
uaagaaacca 
ggcccucuau 
ccaguacucc 
ccccaugggu 
caggaccgag 
acacucgcug 
cugcgguuac 
cacaugcuau 
cucagaaagc 
gaccagguac 
aacauccugu 
ccugaccaga 
cccuaucaau 



3840 
3900 
3960 
4020 
4080 
4140 
4200 
4260 
4320 
4380 
4440 
4500 
4560 
4620 
4680 
4740 
4800 
4860 
4920 
4980 
5040 
5100 
5160 
5220 
5280 
5340 
5400 
5460 
5520 
5580 
5640 
5700 
5760 
5820 
5880 
5940 
6000 
6060 
6120 
6180 
6240 
6300 
6360 
6420 
6480 
6540 
6600 
6660 
6720 
6780 
6840 
6900 
6960 
7020 
7080 
7140 
7200 
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ucauggcugg 
acacacuucu 
auguauggau 
cacgggcuug 
ucagcccuca 
gucagggcgu 
aauugggcgg 
uuauccaguu 
gcccgacccc 
cuacuccccg 
uuuuuuuuuu 
uuucuucccu 
acggcuagcu 
ucugcagauc 



gaaacaucau 
ucuccauucu 
caguauacuc 
acgccuuuuc 
gaaaacuugg 
cccucaucuc 
ugaagaccaa 
gguucaccgu 
gcucauuacu 
cucgguagag 
uuuuuuuuuu 
ucucaucuua 
gugaaagguc 
augu 



ccaguaugcu 
caugguccaa 
cgugaauccu 
uaugcacaca 
ggcgccaccc 
ccguggaggg 
gcucaaacuc 
cggcgccggc 
cuucggccua 
cggcacacac 
uuuuuuuuuu 
uucuacuuuc 
cgugagccgc 



ccaaccauau 
gacacccugg 
uuggaccuuc 
uacucucacc 
cucagggugu 
aaagcggccg 
acuccauugc 
gggggcgaca 
cuccuacuuu 
uagguacacu 
uuuuuuuuuu 
uuucuuggug 
augacugcag 



ggguucgcau 
accagaaccu 
cagccauaau 
acgaacugac 
ggaagagucg 
uuugcggccg 
cggaggcgcg 
uuuuucacag 
ucguaggggu 
ccauagcuaa 
cuuuuuuuuu 
gcuccaucuu 
agagugccgu 



gguccuaaug 
caacuuugag 
ugagagguua 
gcggguggcu 
ggcucgcgca 
auaucucuuc 
ccuacuggac 
cgugucgcgc 
aggccucuuc 
cuguuccuuu 
uuuuucccuc 
agcccuaguc 
aacuggucuc 



7260 
7320 
7380 
7440 
7500 
7560 
7620 
7680 
7740 
7800 
7860 
7920 
7980 
7994 



<210> 9 
<211> 340 
<212> RNA 

<213> Artificial Sequence 



<220> 

<223> Description of Artificial Sequence: synthetic RNA 



<400> 9 

accugccccu aauaggggcg acacuccgcc 

cuucacgcag aaagcgccua gccauggcgu 

cccccucccg ggagagccau aguggucugc 

aagacugggu ccuuucuugg auaaacccac 

caagacugcu agccgaguag cguuggguug 

cgcuugcgag ugccccggga ggucucguag 



augaaucacu ccccugugag gaacuacugu 60 
uaguaugagu gucguacagc cuccaggccc 120 
ggaaccggug aguacaccgg aauugccggg 180 
ucuaugcccg gccauuuggg cgugcccccg 24 0 
cgaaaggccu ugugguacug ccugauaggg 300 
accgugcacc 340 



<210>. 10 
<211> 340 
<212> RNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: synthetic RNA 
<400> 10 

acccgccccu aauaggggcg acacuccgcc augaaucacu ccccugugag gaacuacugu 60 
cuucacgcag aaagcgucua gccauggcgu uaguaugagu gucguacagc cuccaggccc 120 
cccccucccg ggagagccau aguggucugc ggaaccggug aguacaccgg aauugccggg 180 
aagacugggu ccuuucuugg auaaacccac ucuaugcccg gccauuuggg cgugcccccg 24 0 
caagacugcu agccgaguag cguuggguug cgaaaggccu ugugguacug ccugauaggg 300 
ugcuugcgag ugccccggga ggucucguag accgugcacc 34 0 



<210> 11 
<211> 236 
<212> RNA 

<213> Artificial Sequence 



<220> 

<223> Description of Artificial Sequence: synthetic RNA 
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<400> 11 

agcggcacac acuagguaca cuccauagcu 

uuuuuuuuuu uuuuuuuuuu uucuuuuuuu 

uauucuacuu ucuuucuugg uggcuccauc 

uccgugagcc gcaugacugc agagagugcc 



aacuguuccu uuuuuuuuuu uuuuuuuuuu 60 
uuuuuuuccc ucuuucuucc cuucucaucu 120 
uuagcccuag ucacggcuag cugugaaagg 180 
guaacugguc ucucugcaga ucaugu 236 



<210> 12 
<211> 232 
<212> RNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: synthetic RNA 
<400> 12 

agcggcacac auuagcuaca cuccauagcu aacuguuccu uuuuuuuuuu uuuuuuuuuu 60 
uuuuuuuuuu uuuuuuucuu uuuuuuuuuu uuucccucuu ucuucccuuc ucaucuuauu 120 
cuacuuucuu ucuugguggc uccaucuuag cccuggucac ggcuagcugu gaaagguccg 180 
ugagccgcau gacugcagag agugccguaa cuggucucuc ugcagaucau gu 232 



<210> 13 

<211> 17 

<212> DNA 

<213> Artificial 



Sequence 



<220> 

<223> Description of Artificial Sequence: synthetic DNA 
<400> 13 

cgggagagcc atagtgg 17 



<210> 14 

<211> 19 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: synthetic DNA 

<400> 14 

agtaccacaa ggcctttcg 19 



<210> 15 
<211> 21 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: synthetic DNA 
<400> 15 

ctgcggaacc ggtgagtaca c 21 
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<210> 16 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: synthetic DNA 
<400> 16 

aacaagatgg attgcacgca 

<210> 17 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: synthetic DNA 
<400> 17 

cgtcaagaag gcgatagaag 

<210> 18 
<211> 30 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: synthetic DNA 
<400> 18 

gcactctctg cagtcatgcg gctcacggac 

<210> 19 
<211> 28 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: synthetic DNA 
<400> 19 

cccctgtgag gaactactgt cttcacgc 



<210> 20 

<211> 24 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: synthetic DNA 
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<400> 20 

ccgggagagc catagtggtc tgcg 



24 



<210> 21 
<211> 30 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: synthetic DNA 
<400> 21 

ccactcaaag aaaaagtgtg acgagctcgc 30 



<210> 22 
<211> 18 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: synthetic DNA 



<210> 23 

<211> 30 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: synthetic DNA 

<400> 23 

gcggtgaaga ccaagctcaa actcactcca 30 



<210> 24 
<211> 21 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: synthetic DNA 



<400> 22 

ggcttgggca cggcctga 



18 



<400> 24 

agaacctgcg tgcaatccat c 



21 



<210> 25 
<211> 23 
<212> DNA 



<213> Artificial Sequence 
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<220> 

<223> Description of Artificial Sequence: synthetic DNA 



<400> 25 

cccgtcatga gggcgtcggt ggc 



23 



<210> 26 
<211> 27 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: synthetic DNA 
<400> 26 

accagcaacg gtgggcggtt ggtaatc 27 



<210> 27 
<211> 18 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: synthetic DNA 



<210> 28 
<211> 30 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: synthetic DNA 
<400> 28 

agctagccgt gactagggct aagatggagc 30 

<210> 29 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : synthetic 
DNA (primer) 



<400> 27 

ggcacgcgac acgctgtg 



18 



<400> 29 

aacaagatgg attgcacgca 



20 
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<210> 30 

<211> 20 

<212> DNA 

<213> Artificial Sequence 

<220> 

<223> Description of Artificial Sequence : synthetic 
DNA (primer) 

<400> 30 

cgtcaagaag gcgatagaag 

<210> 31 
<211> 30 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : synthetic DNA 
<400> 31 

gcactctctg cagtcatgcg gctcacggac 

<210> 32 
<211> 28 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : synthetic DNA 
<400> 32 

cccctgtgag gaactactgt cttcacgc 

<210> 33 
<211> 24 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence :: synthetic DNA 
<400> 33 

ccgggagagc catagtggtc tgcg 



<210> 34 
<211> 30 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence :: synthetic DNA 
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<400> 34 

ccactcaaag aaaaagtgtg acgagctcgc 

<210> 35 
<211> 18 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : synthetic 
DNA (primer) 

<400> 35 

ggcttgggca cggcctga 

<210> 36 
<211> 30 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence :: synthetic DNA 
<400> 36 

gcggtgaaga ccaagctcaa actcactcca 



<210> 37 

<211> 21 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence :: synthetic DNA 

<400> 37 

agaacctgcg tgcaatccat c 



<210> 38 

<211> 23 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence :: synthetic DNA 

<400> 38 

cccgtcatga gggcgtcggt ggc 

<210> 39 
<211> 27 
<212> DNA 

<213> Artificial Sequence 
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<220> 

<223> Description of Artificial Sequence :: synthetic DNA 
<400> 39 

accagcaacg gtgggcggtt ggtaatc 27 



<210> 40 
<211> 18 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence :: synthetic DNA 
<400> 40 

ggaacgcgac acgctgtg 18 



<210> 41 
<211> 30 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence :: synthetic DNA 
<400> 41 

agctagccgt gactagggct aagatggagc 30 



t 

I 

I 
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